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Novel  numerical  methods  for  optimal  control  and  optimization  problems 
involving  fractional-order  differential  equations 

US  Airforce  Project  15IOA095 

Principal  Investigators:  Song  Wang  and  Volker  Rehbock 

Department  of  Mathematics  &  Statistics 

Curtin  University 


Aims  of  the  project 

The  aim  of  this  project  is  to  develop  numerical  solution  methods  for  optimal  control 
problems  which  are  subject  to  systems  of  fractional  differential  equations.  These  systems 
yield  more  accurate  representations  of  many  real  world  systems  and  can  incorporate  a  more 
global  view  of  the  system  state.  Amongst  other  advantages,  this  allows  modellers  to  include 
features  such  as  memory  effects  in  either  space  or  time.  As  fractional  order  systems  require 
quite  distinct  numerical  solution  methods,  it  is  a  major  task  to  develop  numerical  methods  for 
both  optimal  open  and  closed  loop  control  problems.  Our  aim  is  to  develop  effective  and 
efficient  numerical  methods  for  the  construction  of  solutions  to  fractional  order  optimal 
control  problems.  As  very  few,  if  any,  such  methods  currently  exist,  our  work  will  enable 
future  researchers  and  practitioners  to  address  and  solve  practically  important  optimal  control 
and  optimization  problems  involving  fractional-order  differential  equations. 


Activities  and  achievements  within  this  project 

This  project  officially  started  on  23  September  2015  and  ended  on  22  September  2017. 
However,  our  investigation  and  research  activities  in  the  area  started  early  2015  since  we  first 
proposed  the  project.  The  research  associate,  Dr  Wen  den  Hollander  started  her  employment 
on  24  October  2015.  The  past  two  year  are  very  fruitful  years  during  which  we  have  studies 
various  optimal  control  and  optimization  problems  and  their  applications  arising  in  both 
control  engineering  and  financial  engineering.  Various  research  activities  have  been 
supported  by  the  project  in  the  as  outlined  below. 


1.  A  2nd-order  one-point  numerical  integration  scheme  has  been  developed  and  analysed 
for  solving  fractional  dynamical  systems  which  is  an  integral  part  of  optimal  control 
problem.  Efficient  and  accurate  numerical  methods,  essential  for  solving  fractional 
optional  control  problems,  are  scarce  in  the  open  literature.  In  this  paper  we  propose 
an  efficient  and  easy-to- implement  numerical  method  for  an  a-th  order  ordinary 
differential  equation  when  a  G  (0,1),  based  on  a  one-point  quadrature  rule.  The 
quadrature  point  in  each  sub-interval  of  a  given  partition  with  mesh  size  h  is  chosen 
judiciously  so  that  the  degree  of  accuracy  of  the  quadrature  rule  is  2  in  the  presence  of 
the  singular  integral  kernel.  The  resulting  time-stepping  method  can  be  regarded  as 
the  counterpart  for  fractional  ODEs  of  the  well-known  mid-point  method  for  the  1st- 
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order  ODEs.  We  show  that  the  global  error  in  a  numerical  solution  generated  by  this 
method  is  of  the  2nd-order  accuracy,  independently  of  a.  An  extension  of  this  method 
to  dynamical  systems  involved  in  optimal  control  problems  has  been  discussed. 
Numerical  results  are  presented  to  demonstrate  that  the  computed  rates  of 
convergence  match  the  theoretical  one  very  well  and  that  our  method  is  much  more 
accurate  than  a  well-known  one-step  method  when  a  is  small. 

A  research  paper  containing  the  developed  theoretical  and  numerical  results  has  been 
published  in  an  international  journal. 

2.  A  numerical  algorithm  combining  a  generalization  of  the  algorithm  in  Item  2  above  to 
a  system  of  equations  and  a  gradient -based  method  is  developed  for  solving  general 
fractional  optimal  open-loop  control  problems  ( a  6  (0,1))  with  multiple  states  and 
control  variables.  This  algorithm  has  the  merit  that  it  has  a  2nd"order  convergence  rate 
and  is  computationally  efficient,  and  thus  can  handle  large-scale  fractional  optimal 
control  problems.  A  gradient  formula  has  been  developed  which  forms  the  basis  of  the 
numerical  method  for  multiple  state  and  control  problems.  Convergence  of  the 
method  has  been  proven.  The  combined  method  has  been  coded  using  Matlab 
programming  language  and  extensive  numerical  experiments  have  been  conducted  to 
demonstrate  the  performance  of  the  method  using  optimal  control  problems  with 
multiple  states  and  controls.  The  numerical  results  show  that  the  numerical  scheme 
developed  in  this  project  is  able  to  solve  fractional  optimal  control  problems  of 
practical  significance. 

A  research  paper  containing  the  detailed  description  of  the  method  and  numerical 
experimental  results  has  been  submitted  to  an  international  journal  for  publication. 

3.  A  2nd-order  finite-difference  method  for  a  fractional  differential  complementarity 
(variational  inequality)  problem  of  order  a  G  (1,2)  arising  from  the  stochastic  optimal 
feedback  (closed  loop)  control  in  financial  engineering.  In  this  work  we  have 
designed  the  finite-difference  method  for  solving  the  2nd-order  fractional  partial 
differential  equation  and  showed  that  the  truncation  error  of  the  method  is  of  2nd-order. 
Numerical  experiments  have  been  performed  to  demonstrate  the  accuracy  and 
efficiency  of  the  method.  Dr.  Song  Wang  presented  the  results  as  a  plenary  speaker  at 
the  6th  Conference  on  Numerical  Analysis  &  Applications  held  in  June,  2016  in 
Lozenets,  Bulgaria. 

Two  research  papers  have  been  in  published  respectively  in  an  edited  volume  of 
Lecture  Notes  in  Computer  Science  and  an  international  journal. 

4.  Numerical  solution  of  a  high-dimensional  Hamilton-Jacobi-Bellman  (HJB)  equation 
arising  from  an  optimal  control  feedback  problem  in  engineering.  In  this  paper  we 
propose  a  combination  of  a  penalty  method  and  a  finite  volume  scheme  for  a  four¬ 
dimensional  time-dependent  (HJB)  equation  arising  from  a  stochastic  optimal  control 
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problem  in  pricing  financial  options  with  proportional  transaction  costs  and  stochastic 
volatility.  The  HJB  equation  is  first  approximated  by  a  nonlinear  differential  equation 
containing  penalty  terms.  A  finite  volume  method  along  with  an  upwind  technique  is 
then  developed  for  the  spatial  discrtization  of  the  nonlinear  penalty  equation.  We 
show  that  the  coefficient  matrix  of  the  discretized  system  is  an  M-matrix.  An  iterative 
method  is  proposed  for  solving  the  nonlinear  algebraic  system  and  a  convergence 
theory  is  established  for  the  iterative  method.  Numerical  experiments  are  performed 
using  a  non-trivial  model  pricing  problem  and  the  numerical  results  demonstrate  the 
usefulness  of  the  proposed  method. 

5.  During  the  period  of  this  project,  S.  Wang  has  also  in-kind  contributions  towards  the 
development  of  efficient  numerical  methods  for  the  optimal  control  of  robots. 


Use  of  funds 

The  funds  have  mostly  been  used  for  the  employment  of  the  research  associate,  Dr.  W.  den 
Hollande.  Dr.  S.  Wang’s  travels  to  the  Bulgarian  conference  to  deliver  his  plenary  address 
was  also  partially  supported  by  the  project.  Dr.  S.  Wang  has  also  travelled  to  HK  in 
December  2017,  supported  by  Curtin  University  and  this  project,  to  deliver  an  invited  talk 
entitled  ‘Numerical  solution  of  fractional  optimal  control  problems’  at  ‘The  Workshop  on 
Variational  Analysis  &  Stochastic  Optimization’  organized  by  HK  Polytechnic  University. 


Publications  and  reports  within  this  project 
(W.  Ui  is  the  maiden  name  of  W.  den  Hollander.) 

1.  W.  Li,  S.  Wang,  V.  Rehbock,  Numerical  solution  of  fractional  optimal  control, 
submitted  for  publication. 

2.  W.  Li,  S.  Wang,  V.  Rehbock,  A  2nd-order  one-point  numerical  integration  scheme  for 
fractional  ordinary  differential  equations,  Numerical  Algebra,  Control  &  Optimization, 
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3.  W.  Chen,  S.  Wang,  A  2nd-Order  FDM  for  a  2D  Fractional  Black-Scholes  Equation. 
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5.  W.  Li,  S.  Wang,  Pricing  European  options  with  proportional  transaction  costs  and 
stochastic  volatility  using  a  penalty  approach  and  a  finite  volume  scheme,  Computer 
&  Mathematics  with  Applications,  Vol.73,  2454-2469  (2017). 


DISTRIBUTION  A.  Approved  for  public  release:  distribution  unlimited. 


6.  M.  Tan,  L.S.  Jennings,  S.  Wang,  Analysing  human  periodic  walking  at  different 
speeds  using  parametrization  enhancing  transform  in  dynamic  optimization,  Pacific 
Journal  of  Optimization,  Vol.12,  557-586  (2016). 


Attachments:  Papers  and  reports  listed  above. 


DISTRIBUTION  A.  Approved  for  public  release:  distribution  unlimited. 


Numerical  solution  of  fractional  optimal  control 

Wen  Li,  Song  Wang  and  Volker  Rehbock 

Department  of  Mathematics  &  Statistics 
Curtin  University,  GPO  Box  U1987,  Perth  WA6845,  Australia 
wen. li@ cur tin. edu. au;  song. wang@ cur tin. edu. au 
V . Rehbock@curtin . edu . au 


Abstract 

This  paper  presents  a  numerical  algorithm  for  solving  a  class  of  optimal  con¬ 
trol  problems  with  a  dynamic  system  containing  fractional  differential  equations. 
We  first  propose  a  robust  2nd-order  numerical  integration  scheme  for  the  fractional 
system,  based  a  set  of  judiciously  chosen  quadrature  points.  The  objective  is  ap¬ 
proximated  by  the  trapezoidal  rule.  We  then  apply  a  gradient-based  optimization 
method  to  solve  the  discretized  optimal  control  problem.  Formulas  for  calculat¬ 
ing  the  gradients  with  respect  to  the  unknown  discrete  control  values  are  derived. 
Computational  results  demonstrate  that  the  proposed  method  is  able  to  generate 
good  numerical  approximations  for  optimal  problems  with  multiple  state  and  con¬ 
trol  variables.  The  results  also  show  that  the  method  is  robust  with  respect  to  the 
fractional  orders  of  derivatives  involved  in  the  dynamics. 


1  Introduction 

A  fractional  order  optimal  control  problem  (FOCP)  involves  dynamics  which  are  described 
by  fractional  differential  equations.  In  the  last  decade,  fractional  order  optimal  control 
problems  have  arisen  in  many  fields  such  as  mathematics,  engineering,  biology,  economics, 
finance  and  management.  Various  methods  have  been  developed  for  solving  these  prob¬ 
lems  (see,  for  example, [1,  2,  3,  4,  5,  7,  12,  18,  19,  25,  26,  32,  37]).  In  [1,  2],  Agrawal 
extended  the  classical  control  theory  to  fractional  dynamic  systems  and  derived  fractional 
Euler- Lagrange  equations  for  FOCPs.  These  equations  give  the  necessary  conditions  of 
optimality  for  unconstrained  FOCPs.  The  fractional  Euler- Lagrange  equations  have  been 
solved  numerically  in  [1,  3,  5]  where  the  performance  index  is  assumed  to  be  a  quadratic 
function.  Based  on  the  work  of  [1,  2],  Singha  and  Nahak  [36]  derived  necessary  optimality 
conditions  for  a  class  of  FOCP,  where  the  dynamical  constraints  comprise  a  combination 
of  classical  and  fractional  derivatives.  In  [4,  7,  12,  18,  19,  26,  32,  37]),  the  authors  solved 

*This  work  is  supported  by  the  AOARD  Project  #  15IOA095  from  the  US  Air  Force. 

^Submitted  to  an  international  journal  for  publication. 
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the  FOCPs  directly  without  reference  to  the  necessary  optimality  conditions  of  the  con¬ 
tinuous  problem.  Tricand  and  Chen  [37]  converted  FOCPs  into  a  general,  rational  form  of 
optimal  control  problem  by  a  rational  approximation  method.  In  [4,  7,  12,  18,  19,  26,  32]) 
the  authors  used  polynomial  approximations  of  the  state  and  control  to  solve  an  FOCP. 
In  these  methods,  they  first  derived  an  operational  matrix  for  the  fractional  derivatives 
based  on  the  polynomial  approximation.  Then,  the  system  of  equations  derived  from  the 
dynamic  constraints  was  adjoined  to  the  performance  index.  By  deriving  the  necessary 
conditions  for  the  optimality  of  the  performance  index,  the  given  FOCP  reduces  to  a 
problem  of  solving  a  system  of  algebraic  equations  which  can  be  solved  by  an  iterative 
method.  Bernstein  polynomials  [4,  32],  Jacobi  polynomials  [12,  18],  Legendre  polynomials 
[26,  19]  and  Chebyshev  polynomials  [6]  have  been  used  in  these  papers. 

Although  many  researchers  have  studied  FOCPs,  most  of  them  considered  only  one¬ 
dimensional  FOCPs  involving  one  state  variable  and  one  control  variable.  Recently, 
Alipour  et  al.  [4]  and  Bhrawy  et  al.  [7]  developed  numerical  schemes  for  multi-dimensional 
FOCPs.  In  [4],  the  authors  considered  a  FOCP  in  which  the  performance  index  and  the 
constraint  conditions  of  fractional  differential  equations  are  polynomial  functions  of  the 
state  and  control  variables.  Bhrawy  et  ah  [7]  solved  a  multi-dimensional  FOCP  with 
a  quadratic  performance  index  and  linear  fractional  dynamic  constraints.  Both  of  these 
papers  used  polynomial  approximation  methods  for  solving  the  FOCPs.  In  [4],  Alipour 
et  ah  used  Bernstein  polynomials,  whereas  Bhrawy  et  ah  [7]  used  orthonormal  Legendre 
polynomials.  It  is  well  known  that  approximation  of  the  solution  to  a  differential  equation 
by  high-order  polynomials  often  results  in  ill-conditioned  algebraic  systems  and  numerical 
instability.  To  our  best  knowledge,  there  are  no  numerical  methods  in  the  open  literature 
for  general  FOCPs  with  multiple  states  and  controls  which  are  comparable  to  popular 
existing  numerical  methods  for  conventional  optimal  control  problems. 

There  are  two  commonly  used  definitions  of  a  fractional  derivative:  the  Riemann- 
Liouvillc  and  the  Caputo  fractional  derivative  representations  [34],  In  the  paper,  we  use 
the  Caputo  fractional  derivative  which  is  defined  as  follows. 

Definition  1.1  Assume  that  y(t)  is  differentiable  on  [0,  oo)  for  a  positive  constant  T  and 
0  <  (3  <1.  The  Caputo  derivative  of  order  [3  of  the  function  y(t)  is  defined  as 


o  D%y(t) 


_j —  r 

r(i-/3)  Jo 


y'(r) 

(t  ~  t)P 


dr 


for  t  >  0,  where  r(-)  denotes  the  Gamma  function. 


In  what  follows,  we  present  a  new  direct  numerical  method  for  a  general  multi¬ 
dimensional  FOCPs.  The  aim  is  to  present  a  tractable  method  which  can  be  applied 
to  many  FOCPs  of  practical  significance.  The  general  problem  considered  in  the  paper  is 
described  as  follows. 


min 

u£U 

F(u)  =  /  L(t,  x(t),u(t))  dt  +  S(x(T)), 

Jo 

(i.i) 

subject  to 

f  oDfx{t)  =  f{t,x{t),u{t))r  fe(0,T], 

\  x(0)  =  x°, 

(1.2) 

g(u(t ))  <  o,  t  e  (o,  T], 

(1.3) 
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where  x(t)  =  (xi(t),  x2(t), . . . ,  xn(t))T  G  Mn  and  u(t)  =  (ui(t),  u2(t), . . . ,  um(t))T  G  Mm 
are  the  state  and  control  variables  for  some  positive  integers  n  and  m  respectively,  T  >  0 
is  a  fixed  constant,  /  =  (/i,  /2, . . . ,  fn)T,  L,  S  and  g  =  (<71,  <72,  •  ••,  gP)T  are  known  functions 
for  a  positive  integer  p,  x°  G  Mn  is  a  given  vector,  U  C  Mm  is  the  set  all  bounded  piecewise 
continuous  functions  on  [0,T],  and 

o  D?x(t)  =  (0DrXl(t),0Drx2(t),...,0D^xn(t))T 

with  oD?Xi{t)  denoting  Caputo’s  ar th  derivative  of  Xi(t)  defined  in  Definition  1.1.  In 
the  literature,  almost  all  papers  on  FOCPs  only  consider  a  fractional  order  a  G  [0.5,1). 
In  this  paper,  we  consider  FOCPs  for  all  cq  G  (0, 1)  ,  i  =  1, 2, . . . ,  n. 

To  solve  (1.1) — (1.3)  numerically,  we  need  to  first  introduce  an  approximation  scheme 
for  the  system  of  fractional  differential  equations  (1.2).  In  the  open  literature,  there 
are  a  number  of  different  methods  for  solving  the  initial  value  problem  (1.2).  See,  for 
example,  [14,  15,  16,  17,  9,  21,  22,  23,  24,  27,  28,  31].  However,  none  of  these  methods 
have  a  satisfactory  rate  of  convergence  when  a  is  close  to  zero.  In  our  recent  work 
[30],  we  proposed  a  one-step  2nd-order  numerical  integration  scheme  for  solving  a  scalar 
fractional  differential  equation  based  on  a  one-point  quadrature  rule  with  a  judiciously 
chosen  point  in  each  mesh  subinterval.  This  one-step  numerical  integration  scheme  has 
a  2nd-order  rate  of  convergence  which  is  independent  of  a.  It  is  also  easy  to  implement 
and  computationally  inexpensive.  In  this  paper,  we  will  first  extend  this  method  to  the 
system  (1.2). 

The  rest  of  the  paper  is  organized  as  the  follows.  In  Section  2,  we  first  approximate 
the  constrained  problem  (1.1) — (1.3)  by  an  unconstrained  one  using  a  well-known  penalty 
approach.  Then  we  convert  (1.2)  to  an  equivalent  system  of  Volterra  integral  equations. 
In  Section  3,  we  propose  a  discrete  approximation  of  the  objective  function  and  then 
derive  an  explicit  scheme  for  the  Volterra  integral  equations  based  on  a  Taylor  expansion. 
In  Section  4,  we  derive  a  formula  for  calculating  the  gradient  of  the  discretized  objective 
with  respect  to  the  decision  variables.  Finally,  we  propose  a  gradient-based  algorithm  for 
the  problem  on  the  basis  of  this  gradient  formula.  In  Section  5,  numerical  examples  are 
presented  to  demonstrate  the  accuracy  and  effectiveness  of  the  proposed  method.  Section 
6  concludes  the  paper. 

2  Preliminaries 

We  first  make  the  following  assumptions  on  the  given  functions  in  (1.1) — (1.3): 

Al.  /  is  twice  continuously  differentiable  with  respect  to  all  its  arguments. 

A2.  L  is  continuously  differentiable  in  x  and  u. 

A3.  S  and  g  are  continuously  differentiable  with  respect  to  x  and  u  respectively. 

Clearly,  (1. 1)— (1.3)  is  a  constrained  optimal  control  problem.  We  first  approximate 
the  problem  by  the  following  unconstrained  optimal  control  problem  using  a  penalty 
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approach. 


min 

u£U 


subject  to 


p  Tft 

F(u)  :=  F(u)  +  X  ^2  [  [ 9j(u(t))]2+dt 

.7  =  1  7o 

f  o D?x(t)  =  f(t,x(t),u(t)),  te(0,T], 
\  x(0)  =  a;0, 


(2.1) 

(2.2) 


where  [z]+  =  max {0,2;}  and  A  >  1  is  the  penalty  constant.  This  penalty  approach  has 
been  used  extensively  in  optimization  and  conventional  optimal  control  [10,  11,  20,  29, 
33,  35,  38]  and  it  has  been  shown  that  this  penalty  method  is  exact  in  [13,  38]. 

Since  the  penalty  term  in  the  integrand  of  (2.1)  is  smooth,  it  can  be  combined  with 
the  original  objective  integrand  L  to  form  a  new  integrand  which  is  still  continuously 
differentiable  in  x  and  u.  Therefore,  we  may  rewrite  the  penalized  problem  (2.1)-(2.2)  as 
the  following  general  unconstrained  form: 


min 

u£U 


subject  to 


F(u) 

f  o D?x(t)  =  f(t,x(t),u(t)),  te  (0,I*|, 
1  x(0)  —  x°, 


(2.3) 

(2.4) 


where  x{t),u{t),  f ,  L,  5,  x°  and  o Dfx(t)  are  as  defined  before  and  F  now  contains  the 
penalized  constraints. 

Using  Definition  1.1,  one  can  show  the  following  initial  value  problem  is  equivalent  to 
a  Volterra  integral  equation  as  given  in  the  following  lemma. 


Lemma  2.1  Let  f3  G  (0, 1)  be  a  constant  and  <f>(t,y(t))  a  continuous  function.  Then  the 
initial  value  problem 

f  oD?y(t)  =  <t>(t,y(t)),  t  G  (0, T\, 

X  y{ o)  =  yo 

is  equivalent  to  the  following  Volterra  integral  equation: 

y(t)=y0  (t-T)^-V(T,J/(T))dT,  ft  G  (0,  1), 

for  t  >  0,  where  yo  is  a  given  initial  condition. 

PROOF.  The  proof  can  be  found  in  [8]  and  is  therefore  omitted.  □ 


Using  Lemma  2.1,  we  rewrite  (2.3)-(2.4)  in  the  following  optimal  control  problem: 

min  F(u)  (2.5) 

uCU 

1  f* 

subject  to  Xi(t)  =  x°  +  — — -  /  (t-  r)Qi_1/j(r,  x(t),  u{t ))  dr,  (2.6) 

J-  ( ai)  JO 

t  G  (0,  T],  i  —  1,  2,  ...,  n. 
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3  Discretization  of  (2.5)  and  (2.6) 

In  this  section,  we  propose  an  algorithm  for  the  numerical  solution  of  (2.5)-(2.6). 

Let  N  be  a  given  positive  integer.  We  divide  [0,  T]  into  a  uniform  mesh  with  the  mesh 
points  tj  =  jh  for  j  —  0, 1, . . . ,  N,  where  h  =  T /N.  Using  this  partition,  we  approximate 
the  objective  F  in  (2.5)  with  the  trapezoidal  rule  as  follows. 


F(u)  ^L(t0,x(to),u(t0))h  +  ^2  L(tj,x(tj),u(tj))h 

3= 1 


N—\ 


+\l(tN ,  x(tN),u(tN))h  +  S(x(tN)). 


(3.1) 


By  (2.6),  for  each  i  =  1,  2, ...,  n  and  j  —  1,  2, . . . ,  N,  we  have 


Xiitj)  =  x°  + 


=  xr  + 


r(«i)  Jo 
i  ph 


(tj-r)ai  fi(T,x(T),u(r))dT 


r(a»)  Jo 

i  j  pkh 


(. jh-r)ai  fi(T,x(T),u(T))dT 


=  Xi 


+  rTw5Z/  (jh~r)ai  1fi(r,x(r),u{T))dT.  (3.2) 

r(«i)  ~J(k-i)h 


We  now  consider  an  approximation  for  the  integral  on  the  right  hand  side  of  (3.2). 
By  Assumption  Al,  fi(t,x(t),u(t ))  is  twice  continuously  differentiable  with  respect  to  t, 
x  and  u.  Thus,  for  k  =  1,2  we  use  Taylor’s  theorem  for  /j(r,  x(t),  u[t))  at  any 

point  Tjk  G  ((k  —  l)h,  kh )  to  yield 


Mf  x(t),u(t))  =  fi{Tljk ,  x(T]k),u(Fjk))  +  K]k{r  -  rjfc)  +  c)k{r  -  r}k)  ,  (3.3) 

where  is  the  coefficient  of  the  reminder  of  the  expansion  and 


ic  FA 

jk  dr 


(.Tjk’xFjk)Mrxjk)) 


^  dxi 

i=i  1 


(Tji kM^k),upjk 


dxi 

))  8 T 


171  ry  r 

V  — 

dur 

r=l 


dui 

Fk,xFk),upk))  d T 


Fjk) 


Therefore,  replacing  /j(r,  x(r),  u(r))  in  the  integrand  of  the  last  term  in  (3.2)  with  the 
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RHS  of  (3.3),  we  have,  for  any  k  —  1, ...,  j, 


fkh 


r(ati) 


'  (k—l)h 


(. jh-T)ai  fi(T,x(T),u(T))dT 


fkh 


r(^)  J(k-l)h 

1 


(. jh  -  r)ai  1  [fi(r]k,  x{Tljk),u{r]k))  +  K)k{r  -  r)k  )]dr  +  R) 


r(ati 


fi 


jk")  u'\l  jk)  ?  jk) )  '  lvjk\‘  1  jk. 

{jh  —  (k  —  l)/r)Qi  {jh  —  kh)' 


jk 


OLi 


OLi 


Kjk  fkh 


{jh  -  r)Qi  (r  -  tL)  dr  +  R) 


jk 


r(«i)  J(k~\)h 

zY{a-  +  i^Tl^xiT}k)^u{Tljk))[{j  -k  +  l)ai  -  {j  -  k)c 


K)k  fkh 


r(«»)  J(k  -1  )h 

nkh 


(jh  -  r)"‘  (r  -  T'ik)dr  +  R‘k, 


(3.4) 


where  Bljk  = 


{jh  -  r)Qi  c)k{r  -  Tljk)  dr. 


■^(^)  J (k—l)h 

We  now  consider  the  choice  of  rjfc.  From  (3.4)  it  is  clear  that  rjfc  should  be  chosen 
such  that  the  second  term  becomes  zero  so  that  the  truncation  error  in  (3.4)  is  This 
choice  of  rjfc  is  given  in  the  following  theorem. 

Theorem  3.1  For  any  given  j  G  {1,  2, ... ,  N}  and  k  e  {1,  2  . . . ,  j},  the  unique  solution 
to 


fkh 


f(k-l)h 


{jh~T)^-\T-T]k)dT  =  0 


is  given  by 


TJk  =  h 


[{j  —  k  +  l)Qi+1  —  (j  —  k)ai+1]  +  {oii  +  l)[(j  —  k  +  1  )ai{k  —  1)  —  {j  —  k)aik\ 


(a*  +  1)  [(j  -k  +  l)ai  -  {j  -  k)ai] 

Furthermore,  (k  —  1  )h  <  rjfc  <  kh. 

PROOF.  See  the  proof  of  Theorem  2.1  in  [30]. 


(3.5) 


□ 


Substituting  the  expression  for  in  (3.5)  into  (3.4)  and  combining  the  resulting 
expression  with  (3.2),  we  have  the  following  representation  for  Xi{tj). 


Xi(tj )  =x°i  + 


h° 


r(«i  + 1) 


£  fi(Tjk’x(Tjk)>u(Tjk))[U  ~k+  l)ai  -  U  -  k)ai l  +  R)y  (3-6) 


fc=i 


for  j  =  1,  2, . . . ,  N,  where  rjfc  is  given  in  (3.5)  for  k  =  1,2, ...  ,j  and  Bj  =  Yjk=i  Rjk- 
Omitting  the  remainder  Bj  in  (3.6),  we  have  an  equation  approximating  (3.2)  which  has 
the  truncation  error  Bj.  An  upper  bound  for  Bj  is  given  in  the  following  theorem. 
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Theorem  3.2  Let  Assumption  A1  be  fulfilled.  Then  the  following  estimate  holds: 

\R)\  <  Ch2, 

where  C  denotes  a  positive  constant  independent  of  h. 

PROOF.  See  the  proof  of  Theorem  2.2  in  [30].  □ 


From  (3.6)  it  is  clear  that  to  compute  xfitfi,  we  need  to  calculate  ffirfi,  xififi,  uififi), 
where  x(rljk)  =  (x i {t‘ k).  xfirfi. .xfirfi.)).  u(rljk )  =  (ufir^),  u2(rljk), . . . ,  um(Tljk))  and 
k  =  1,2,  However,  xififi^ufirfi)  are  not  available  directly  from  the  scheme,  al¬ 

though  the  points  r'-k  for  feasible  ifi  and  k  are  known.  Thus,  approximations  for  xfirfi) 
and  u(rljk )  need  to  be  determined.  Next,  we  propose  a  numerical  scheme  based  on  a  linear 
interpolation  and  a  Taylor  expansion  for  approximating  x(rjfc)  and  u(rjfc). 

For  any  indices  j  and  k  satisfying  1  <  k  <  j  <  N,  since  rfi  G  {tk-i,tk)  by  Theorem 
3.1,  we  use  the  following  linear  interpolation  to  approximate  xfirfi)  and  ur(rfi.): 


where 


xifik)  ~  x{tk-fi  +  p)k{x{tk )  -  x(tk-fi ), 

(3.7) 

u{Tjk)  ~  U(tk-fi  +  p)k(u(tk)  ~  U(tk-fi), 

(3.8) 

P'jt  ■=  T>"  ~fi~'  £  (0, 1). 

(3.9) 

The  truncation  error  in  the  above  linear  interpolation  is  of  order  0{h2).  Using  (3.7)  and 
(3.8),  we  approximate  /j(rjfc, x(Tjk),u(Tjk))  as  follows. 


fi(TtjkXrtjk),u(Tjk)) 

~  fi  xitk.fi  +  pljk{x(tk)  -  x(tk_fi),  u(tk-fi  +  fijfiuikfi  -  u(tk-fi ))  (3.10) 

The  truncation  error  for  the  above  approximation  is  also  of  order  0(h2). 

Replacing  /j(r*fc, x(rjfc), u(rjfe))  in  (3.6)  with  the  RHS  of  (3.10),  we  have,  up  to  some 
terms  of  order  0{h2),  the  following  scheme  for  (2.6): 


xfitfi  =  x°t+hai  [fi  (rjfc^(4~i)  +  Pjk{x{tk)  ~  x(tk-fi),u(tk-fi  +  Pjk(u(tk)  ~  u(tk- 1))) 


k= 1 


.((j-k+ir-(j-kr) 


(3.11) 


for  j  =  1,  2, . . . ,  N,  where  ha%  =  and  rjfc  is  defined  by  (3.5).  Clearly,  (3.11)  defines  a 

time-stepping  scheme  for  (2.6)  with  a  truncation  error  of  order  0{h2)  because  of  Theorem 
3.2  and  the  truncation  error  in  (3.10). 

The  above  scheme  is  implicit  as  it  constitutes  a  nonlinear  system  in  x(tfi  =  [x\ (tfi,  ■  ■  ■  ,xn(tj))T. 
An  iterative  method  such  as  a  Newton’s  method  can  be  used  for  solving  (3.11).  However, 
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it  is  also  possible  to  define  an  explicit  single  step  scheme  by  further  approximating  the 
7 th  term  in  the  sum  in  (3.11)  by  the  following  Taylor  expansion: 

fi(TjjAtj- 1)  +  p'ifoih)  -  w(V-i)  +  ~  u^j- 1))) 

=  1)  +  Pjj(u(tj)  -  u(tj- 1))) 

dfi 


('rh’:E(b'-i)>u(b-i)+/>b(u(b)-u(b-l))) 


(pjjfaitj)  -  xl(tj- 1))) 


(3.12) 


^  dxi 
i=i  1 

+  0{h2). 

Thus,  combining  (3.12)  and  (3.11)  yields 
j~l  r 

Xi(tj)  =x°i  +  hai  Y  [fi  (Tjk,x(tk- i)  +  PjMtk)  -  x(tk-1)),u(t/c-i)  +  p)k{u{tk)  -  u(tk- 1))) 

k= 1 

((j  -  k  +  l)Qi  -  (j  -  k)ai)  +  ha.fi  (rh,a;(ij_i),w(ij_i)  +  p)j{u{tj)  -  u(ij_i))) 
n  rdf-  -I 

+  /l«i5Z  +^2)- 

(3.13) 


j=i 


Let  ad  :=  (xi(ij)),x2(ij),. . .  ,xn(ij))T  and  td  :=  (ui  (£,■), w2(tj),  •  •  •  ,Wm(t,))T  for  j  = 
0, 1,  ...,1V  with  the  given  initial  condition  a;0.  Omitting  the  truncation  error  terms  of 
order  0(h2)  and  re-organising  (3.13),  we  have  the  following  linear  system  for  ad. 

Bj(xj~1,uj~1,uj)xj  =  Cj(x°,  x1,  x2,  ...,xj~1,u°,u1,  ...,td),  j  —  1,  2, . . . ,  N.  (3.14) 

where  Ld  is  the  n  x  n  matrix  given  by 

(l  -  &i'-i  -bio  . . .  -b{n  \ 


Bj  = 


Jn  ui2 

-6j2i  1  -  bi 


22 


-bi 


2  n 


(3.15) 


\  ~Kl  ~Ka  ■■■  1  -  KnJ 


with 


U  i  ,  dfi 


(Tjj, xj  buJ'  1+pjj(ui -U,j  !)) 


(3.16) 


for  i  =  1,  2, . . . ,  n,  l  =  1,  2, . . . ,  n  and  0  =  (c{,  c^, . . . ,  c^)T  with 

l-1  r 

4  -*? + k  Y.  y  (i^**-‘+4(**  -  + &<«*  -  «*-■)) 


fc=i 


((j  -  k  +  l)a<  -  (j  -  /c)Qi))  +  KJi  (Tjj,  XJ  \uJ  1  +  pL(rd  -  *)) 

n 

-ET‘4-  (3.17) 


i=l 


It  is  clear  that  to  calculate  ad,  we  need  to  solve  the  system  of  equations  (3.14)-(3.17). 
Next  we  show  that  (3.14)-(3.17)  is  uniquely  solvable  when  h  is  sufficiently  small. 
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Theorem  3.3  The  system  (3.  If)- (3. 17)  has  a  unique  solution  when  h  is  sufficiently 
small. 


PROOF.  We  will  first  show  that  for  j  =  1,  2, . . . ,  N,  B J  is  a  strictly  diagonally  dominant 
matrix,  i.e., 

n 

1  -Hi  >  1^1’  i  = 

1=1, l^i 

Since 

i-4>i-l4l, 

we  only  need  to  show  that 

n 

1-I4I>  E  141, 

1=1, 

or  equivalently, 

n 

Vi. 

i=i 

Note  that  pG  G  (0, 1)  by  (3.9)  and  h0i  >  0.  We  have,  from  (3.16), 


141  <  ft 


Of, 

qx  I  Mtj - 1 ) - 1 )+p)j  ( u(tj ) -u(tj _ i ))) 


Since  /,;  is  twice  differentiable  in  x  on  [0, 1],  is  bounded  on  [0, 1].  Let 


M 


max 

l<i<n 
1  <l<n 


dfi 

dxi 


We  have 


Choose  hi 


/  r(«j+i) 
V  nM 


141  <  KM  =  .,M. 

f  (a*  +  i ) 

“*  and  h  =  mini<j<n{hj}.  When  h  <  h,  we  have 


Vi. 


Thus,  BJ  is  a  strictly  diagonally  dominant  matrix  for  all  j.  By  the  well-known  Levy- 
Desplanques  theorem,  we  conclude  that  LP  is  a  non-singular  matrix  and  therefore  the 
system  (3.14)-(3.17)  has  a  unique  solution.  □ 


Note  that,  for  a  given  initial  condition  a:0,  (3.14)  provides  a  one-step  explicit  scheme 
for  approximating  the  solution  to  (2.6).  Introduce  X7  =  (x\,  xJ2, ...,  a;^)T  and  I/J  = 
for  j  =  0,1,...,  IV,  and  let  X  =  (X°,  X1, ...,  XN)  G  Mnx(iV+0  and  U  = 
(U°,  U1, ...,  C/^)  G  ]Rnx(JV+1).  Using  the  approximation  schemes  defined  in  (3.1)  and  (3.14), 
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we  pose  the  following  finite-dimensional  optimal  control  problem  approximating  (2.5)- 

(2.6): 


min 

f/g]RmX(JV+l) 


subject  to 


Fh{X,  U )  :=  ^  (L(t0,  A0,  U°)  +  L(tN ,  XN ,  UN ))  h 

N-l 

+  Uj)h  +  5'(A"Ar),  (3.18) 

3= 1 

f  ^(A^1,  CP"1,  EP)AJ  =  (A (A0,  A1, ...,  A-j-1,  f/°,  C/\  ...,  [/•?), 

<  J  —  1;  2, ...,  A,  (3.19) 

A0  =  a;0. 


A  solution  (X,U)  to  (3.18)-(3.19)  is  an  approximation  to  a  solution  (x(t),u(t))  of  (2.5)- 
(2.6)  at  the  mesh  nodes  tj,j  =  0, 1, ...,  N. 


4  Solution  strategy 

In  this  section,  we  first  determine  the  gradient  of  the  objective  (3.18)  with  respect  to 
all  U .  We  then  develop  an  algorithm  for  finding  approximate  solutions  to  the  problem 

(3.18) -(3.19). 

From  the  definition  of  U  we  see  that  the  lst-order  optimality  conditions  for  (3.18)  and 

(3.19)  are 

dF 

-|  =  0,  r  —  1,2, ,  m,  j  —  0,1, . . . ,  N.  (4.1) 

Ulij- 

We  now  determine  the  LHS  of  (4.1).  From  (3.18)  and  (3.14)-(3.17),  we  have 

dFh  _1  dL(t0,X°,U°)  ^  dL(tp,  Xp,  UP)  dx p 

du°r  2  ?  du°r  ^  Qxp  du°r 

1  ^dL(tN,XN,UN))dx?  ^dS(XN)dx? 

2  QXN  Qu0  2-^/  QXN  Qu 0  ’ 

1=1  1  1  1=1  L  1 

dFh  dLjtj,  X*,U*)  ^-4^dL(tp,XP,UP)dxpi 

dui  dui  ■“  ~  dxp  dui 

1  ^  dL(tN,XN,UN)dx?  dS( XN)  dx f 
+  2  ^  dx f  dui  +  “  dui 


for  j  =  1, . . . ,  A  and  r  =  1,  2, . . . ,  m. 

Qx'j P 

From  (3.19)  we  see  that  —A  =  0  when  p  <  j. 


Thus,  the  above  expression  can  be 
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rewritten  as 


d  Fh  1 ,  d  L 


+»EE 


du9.  2  dur  I  (t0,x°,u°)  ^  rr*  dxi  I  ( tp,xp,up )  du9. 

p=  1  Z=1 


dxf  22  <92  drcf 

— —  +  >  —  — — . 

i  t  ^  r),7'  •  (vN\ 


2  ^  dxi  du °  “  <9x;  l(x^)  <9^°  ’ 

1=1  1=1 


d Fh  ,  dL 


'‘EE 


<9w£  dur  I ^  z--'  dxi  I ( tp,xp,up )  dui 

P — ^ — -L 


iw* 

9  2^  /It-. 


2  ^  <9Zj  l(tw,X'v,[/Af)  Qu3 


dxf  22  <92  <9a;f  .  Ar 

— V  +  \  — U  ?  =  12...  N. 

a,, 7  2^  (x'V)  <9?2  ’ 


We  now  need  to  determine  Xf  for  j  —  0, 1, 2, . . . ,  N.  By  (3.14),  we  have 

BPXP  =  cp,  p  =  1,  2, . . . ,  N. 

Taking  the  derivative  w.r.t.  uJr  on  both  sides  of  the  above  equation  gives 

dJ^Xp  +  Bp—  =  —. 
dzir  dui  dui 

Rearranging  the  above  equation,  we  have 

RPdXP  =  9CP_  8B^xp 


dXp 

dCp 

dui 

dui 

0  .  dx\  dx p 

dui  ’  dui  ’ 

_  fdci 

dcl2 

^  dui  ’  dui  ’ 

db12 

dv?r 

dul* 

db22 

dv?r 

dv?r 

9K.2 

\  dui 

dv?r 

Using  (3.16)  and  (3.17)  we  see  that  the  entries  of  the  above  matrices  are  given  by 

&K  ih  d(&n\  ^^1,u*-1+r„(u*-up-i))) 

d^r  =PpP  “*  <%2  ’ 


(d(Mr;k,  xk_1  +  p;k(xk  -  xk~l),  u ^  +  ppk(uk  -  uk~ *) 

'  ai  \  dui 

■  ({p-k+  l)ai  -  (/;-/.•)“'))) 

dft  (r;p,xp-\u^  +  p;p(up~u^))  _  22  .  ,  a&S 

dui  dui  il  1  dv  ~ 


.P-1  wuil 

1  duJ ' 
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for  i  —  1,  2, . . . ,  n,  l  —  1,  2, . . . ,  n. 

To  calculate  and  in  (4.5)-(4.6)  for  p  =  1,2, ...  ,N  and  j  =  0, 1, IV,  we  use 
the  following  algorithm: 

Algorithm  A 


1.  When  j  =  0,  if  p  —  1,  then 


db‘  =  AlK  ^ 


;  dxidur 


(r^.xo^+pi^m-u0)) 


(1  Pii)j 


du ° 

If  p  >  1, then 


o  ^ 


^Ci  _  i  dfi  |  ri  j  n,  \  ^ 

-  'W  o^-|(r1ii,x0,f/0+pii(c/1-r/°)li  -  Pn)  -  qTo>- 

r  1=1  r 


dK 

du® 


=P 


'pphaa  ^  ( 


q= i 


d2h 

dxidx„  nTpp 


\(TiXr-\UP-i+pi(UP-Ur-l)) 


dxpq  ^ 

<9w2 


&  fi 

du6  ^haidu^Tlv^x°+pU  {x1-x°),u°+pU(u1-u°)(1  -  dpi)(pQi  -  (p  -  !)“') 


f  p  1  -  n  /  ()  J~ 1 

+haA  ^2  ^2  (^rl h;fcAfc-i+p;fc(^fc-^fc-i)Tfc-i+4fc(^-^-i))(1  -  d^.) 

fc=2  f  q=l 


dx1! 


-1 


<9«2 


dfi 


dx1! 


+^2Kk’xk~1+pU^xk-xk~1^uk~1+pU^uk~uk~1^Ppk^J 
((p  —  k  +  l)ai  -  ip  -  A;)ai) 

n 

-E(^i 5+*f 


<9aZ 


,7  /  |  ~~g 


-l 


„=1 


<9w2 


n  ^ry  1(%fA 

J=1  ,  <9<  1  du°rJ ' 


2. 


For  j  =  1,  2, . ,  A,  if  p  <  j ,  then 

If  P  —  j,  then 


0. 


dK 

dul 

M 

ch4 


p*iA 


d2fi 


33  ‘  dxidur  (Tpp 


(tL  ,X0-\W-^+p\  AUi-W-i))Pjj, 
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If  P  —  3  +  1)  then 


dK 

dul 


diet 


=  P'h, 


d2h 


pp-"<*i  Qx  q  l(rip,XJ>-1>[/P-1+pj,J)([/P-^-1))(1  -  Ppp) 

I  f  h  ( X'  I  ^  1 

+  Pppfta;  I  2^  ^^rl(r'p,XP-1,C/P-1+pip(C/P-C/P-1)) 


9=1 


dxidx„  y  pp v 


(9«r 


=  h, 


&i 


dfi 


d u~  ’K'pj 

dfi 


I  (r*, ,-Yl - 1  +p*  . (XI -XI - 1  ),C/1- 1  +#»* , (C/1  - CP'-1 ) Ppj 


(Of*  I  P  dxJ(l  1  A 

2^  !,^HrY,Xl-1+pX(Xl-Xl-i),C/l-i+pX(C/l-C/l-i))Ppj-^r>) 
9=1  q  r 


(2ai  -  1) 


+  h0 


dfi 


8Ur  KpP 


l(r^,XP-i,C/P-1+pj,p(C/P-C/P-1))(l  -  Ppp) 


(9«r 


/  dfi , 

Va^l(T^’XP"1,l/P“1+pi'p(C/P-l/P“1)) 


-1 


duri 


If  p  >  j  +  1,  then 


0&S 

duJr 

del 

dul 


-i 


dut 


i  h  ^  fi  I  ^9 

Pppn<*>  ar,ar  l(Y,xp-i,c/p-i+Pjp(c/p-c/p-i))- 

9=1  9 


k=j  L 


dut 


(  dfi  I  ft  *  N9x9 

5=1  \  9 

1^1  x  93*' 

+  dx~J  (Y’Xfe-1+pZ(xfc-xfc-1),c/fc-1+p;fc(c/'=-c/'=-1))Ppfc; 


dWr 


(p  —  k  +  l)ai  —  (p  —  /c) 


\«») 


<9aZ 


h  I  (dfi  |  —9 

Z^l^  l(^p.^p-Mip-1+pj,p(c/p-c/p-i) 


—l. 


9=1 


<9wP 


Ef^V*rlM 

i=i 


dul 


duJ  )  ’ 


where 


Ppfc>  k  =  j , 

VKPpfc)  —  ^  1  “  Ppfc;  k  —  j  +  1, 
0,  k  >  j  +  1 . 
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Using  Algorithm  A,  we  propose  the  following  gradient-based  search  algorithm  for 
solving  (3.18)— (3.19): 

Algorithm  B 

1.  For  a  given  positive  integer  N ,  let  tj  =  jh  for  j  =  0,1, N ,  where  h  =  T /N. 

2.  Set  k  —  0.  Choose  a  tolerance  £  >  0  and  an  initial  value  £/(0)  =  ( U° ,  U1, ...,  UN G 
|"*x(iV+t 


3.  Calculate  X™  =  (X°,  X1, ...,  XN)W  G  Mrix(iV+1)  using  (3.19). 

4.  Use  Algorithm  A  and  X^k\[/^  obtained  to  calculate  (^j)  and 


p  =  1,2, ...  ,N ,  j  =  0,1,2, ...  ,N ,  and  r  —  1,2, ... ,  m. 


\  dv?r  ) 


(k) 

AA)  for 


5.  Solve  (4.4)  for 


dXp ^ ik)  ^ dxp  dxp  dxpn)\ 
dui  J 


W  T 


dul  ’  dul  ’  ’  du3r  / 

9F,  DFh  dFh\W t 


6.  Compute  XFh(X(k),  U(k) )  =  . . . ,  7^)  using  (4.2)  and  (4.3),  where 

=  (if .  fh  -  ■ ,  f|)T  f0O  =  .  JV.  If  HVJipfW, r/<‘>)||  <  £,  goto  Step 

8.  Otherwise,  continue. 

7.  Compute  (Adfc+1\  f/^"1"1^)  using  the  backtracking  line  search  method  as  follows: 


7a.  Choose  <r°  >  0,  f3  G  (0, 1)  and  7  G  (0, 1).  Let  l  =  1. 

7b.  Update  a1  =  /3crz— 1 . 

7c.  Let  U  =  U ^  —  alX7 Fh(X(k\  U (k^)  and  compute  X  using  (3.19)  and  U . 

7d.  If  Fh(X,  U)  <  Ffe(A(fc),t/(fc))-7^||VFft(AW,t/W)||2,sett/(fc+i)  =  U,X^  = 
X  and  k  —  k  +  1,  and  goto  Step  4.  Otherwise,  let  l  =  /  +  1  and  goto  Step  7b. 

8.  Let  (X*,U*)  =  (JW,[/^)  and  evaluate  Fh(X*,U*)  using  (3.18). 


Remark:  When  using  Algorithm  B,  we  need  to  solve  the  two  systems  (3.14)  and  (4.4). 
Note  that  the  square  coefficient  matrices  in  these  two  systems  are  the  same.  Thus,  we  only 
need  to  calculate  one  matrix  inverse.  It  is  also  worth  pointing  out  that  when  implementing 
Algorithm  A  we  can  calculate  (yr,  777)  for  tj,j  =  1,  2, . . . ,  N  in  parallel.  This  can  save 
a  considerable  amount  of  CPU  time  when  N  is  large  . 


5  Numerical  Results 

In  this  section,  we  will  use  Algorithm  B  to  solve  several  non-trivial  examples.  In  our 
numerical  experiments,  we  choose  f3  =  0.6,  7  =  0.05  and  <to  =  10  in  Algorithm  B. 
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Example  1.  Consider  the  following  FOCP  with  two  state  variables 


min  F(u )  =  -  f  ( x\(t )  +  x\(t)  +  u2(t))dt, 

2  Jo 

(  oD^x^t)  =  -xi (t)  +  x2(t)  +  u(t), 
subject  to  <  oD^2x2(t)  =  —2x2(t), 

(  xi(0)  =  x2(0)  =  1. 

The  exact  solution  for  =  a2  =  1  is 

O 

x,  (t)  =  - -e~2‘  +  2.48164e“'/5‘  +  0.018352e'/a, 
x2(i)  =  e~2‘, 

u(t)  =  F2t  -  1.02793e~'/5‘  +  0.0443056e'/5t. 


(5.1) 


(5.2) 

(5.3) 

(5.4) 


This  test  problem  is  taken  from  [7].  Substituting  (5.2),  (5.3)  and  (5.4)  into  (5.1),  we  find 
the  exact  cost  F  is 

F  =  0.431984 

We  first  solve  the  problem  for  various  choices  of  N  when  a\  —  a2  —  1.  The  optimal 
values  of  F  are  listed  in  Table  (5.1).  It  is  clear  from  Table  5.1  that  the  computed  optimal 


Table  5.1:  Optimal  cost  F  for  different  choices  of  N  with  =  a2  =  1  for  Example  1. 


N 

20 

40 

80 

160 

320 

F 

0.432743 

0.432180 

0.432036 

0.431999 

0.431990 

cost  approaches  the  theoretical  one  as  N  becomes  larger  and  all  the  computed  costs 
over-estimate  the  exact  one. 

We  then  solve  Example  1  for  various  choices  of  a±  and  a2  when  N  =  200.  The  results 
are  shown  in  Table  5.2  and  Figures  5. 1-5. 3.  From  Table  5.2  we  see  that  the  total  cost 
decreases  as  the  values  of  ou  and  a2  decrease. 

Table  5.2:  Optimal  value  of  F  for  different  choices  of  cki,ck2  for  Example  1;  N  —  200 


a 

(X  i  —  (X  2  —  1 

cti  =  0.9,  <x2  =  0.5 

cti  =  0.2,  <x2  =  0.3 

F 

0.431995 

0.347784 

0.267591 

Example  2.  Consider  the  following  FOCP  with  bound  constraints  on  the  control. 

min  F(u )  =  ^  f  ( x\(t )  +  x\(t)  +  u2(t))  dt, 

2  Jo 

(  o D^x^t)  =  -xi (t)  +  x2(t)  +  u(t), 
subject  to  <  oD^2x2{t)  =  —2 x2(t), 

{  £i(0)  =  x2(0)  =  1, 

-0.2  <  u(t)  <  0. 
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Optimal  u  for  Example  1  with  a^a^l 


0  0.2  0.4  0.6  0.8  1 


t 


Figure  5.1:  Optimal  values  of  Xi,x2  and  u  for  Example  1  with  a±  =  a2  =  1 


0 

-0.05 

-0.1 

-0.15 

-0.2 

-0.25 

-0.3 

-0.35 

-0.4 

-0.45 

-0.5 


Optimal  u  for  Example  1  witha^O.9  and  a2=0.5 


Figure  5.2:  Optimal  values  of  x±:x2  and  u  for  Example  1  with  au  =  0.9,  a2  =  0.5 


Figure  5.3:  Optimal  values  of  X\,x2  and  u  for  Example  1  with  ai  =  0.2,  a2  =  0.3 


This  example  is  the  same  as  Example  1  with  lower  and  upper  bounds  on  the  control  u. 
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To  solve  this  problem,  we  first  transform  the  FOCP  to  the  following  form: 

min  F(u)  —  -  f  (, x\(t )  +  x\{t)  +  w2(t))  dt  +  A  f  ((w2(t)+  +  ( —u(t )  —  0.2)+)  dt, 

2  Jo  Jo 

subject  to  the  system  of  dynamic  constraints 

oDt'xiit)  =  -xi  (t)  +  x2{t)  +  u(t), 

0D?2x2(t)  =  -2  x2(t), 
ici(0)  =  £2(0)  =  1. 

We  choose  A  =  10,  000.  The  computed  optimal  values  of  F  corresponding  to  different 
values  of  a  are  shown  in  Table  5.3  in  which  we  also  list  the  computed  optimal  costs  of 
Example  1  for  comparison.  From  the  table  we  see  that  computed  optimal  costs  for  the 
constrained  problem  are  slightly  bigger  than  the  corresponding  optimal  costs  of  the  uncon¬ 
strained  problem  which  is  reasonable.  To  further  see  the  difference  between  results  from 
the  unconstrained  and  constrained  problems,  we  plot  the  optimal  values  of  X\ (t),x2{t) 
and  u{t)  for  the  different  values  of  a  in  Figures  5. 5-5. 6.  From  the  figures  we  see  that  u 
satisfies  the  constraints. 

Table  5.3:  Optimal  value  of  F  from  different  choice  of  a±,  a2  for  Example  1  and  2;  N  =  200 


- 

CXi  —  Q?2  —  1 

=  0.9,  a2  =  0.5 

=  0.2,  a2  =  0.3 

Example  1 
Example  2 

0.431995 

0.442711 

0.347784 

0.356319 

0.267591 

0.272743 

Optimal  u  for  Example  2  with«1=a2=1 

-0.02  -  /  - 
-0.04  - 
-0.06  - 
-0.08  - 
-0.1  - 
-0.12  - 
-0.14  - 
-0.16  - 
-0.18  - 

-0.2  . 1 . x - 1 - 1 - 1 -  t 

0  0.2  0.4  0.6  0.8  1 


Figure  5.4:  Optimal  values  of  Xi,x2  and  u  for  Example  2  with  a±  =  a2  =  1 


Example  3.  Consider  the  following  FOCP  containing  two  states  and  two  controls. 


min  F(u)  =  -  /  (xl(t)  +  x22(t)  +  u\(t)  +  u\(t))  dt , 
2  Jo 

(  o D^xxit)  =  -xi (t)  +  x2{t)  +  ui(t), 
subject  to  <  o Dt2x2(t)  =  —2 x2(t)  +u2(t), 

[  a;i(0)  =  x2(0)  =  1. 
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Figure  5.5:  Optimal  values  of  xi,x2  and 


Optimal  u  for  Example  2  wither.,  =0.9  and  q2=0.5 

-0.02  - 
-0.04  - 
-0.06  - 
-0.08  - 
-0.1  - 
-0.12  - 
-0.14  - 
-0.16  - 
-0.18  - 

-0.2 - 1 - — - 1 - 1 - 1 -  t 

0  0.2  0.4  0.6  0.8  1 

u  for  Example  2  with  an  =  0.9,  a2  =  0.5 


Figure  5.6:  Optimal  values  of  x±,  x2  and  u  for  Example  2  with  an  =  0.2,  a2  =  0.3 

This  problem  is  an  extension  of  Example  1  and  reduces  to  Example  1  when  u2  =  0.  The 
optimal  values  of  F  corresponding  to  various  values  of  a  are  listed  in  Table  5.4  in  which 
we  also  list  the  optimal  costs  from  Example  1  for  comparison.  From  the  table  we  see  that 
optimal  costs  from  Example  3  are  smaller  than  the  corresponding  ones  from  Example  1. 
This  is  expected  as  Example  1  is  a  special  case  of  Example  3.  We  also  plot  some  of  the 
optimal  states  and  controls  in  Figures  5. 7-5. 9. 


Table  5.4:  Optimal  value  of  F  at  different  choice  of  an,a2  for  Example  1  and  3  with 
N  =  200  _ 


- 

CXi  —  Q?2  —  1 

an  =  0.9,  a2  =  0.5 

«i  =  0.2,  a2  =  0.3 

Example  1 
Example  3 

0.431995 

0.417228 

0.347784 

0.332577 

0.267591 

0.259506 
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Optimal  x1=x2  for  Example  3  with  a1=a2=1 


Optimal  u  1  for  Example  3  with  a1=a2=1  Optimal  u2  for  Example  3  with  a1=“2=1 


t 


Figure  5.7:  Optimal  values  of  aq,  x2  and  Ui,u2  for  Example  3  with  aq  =  a2  =  1 


Example  4. 

rlO 

min  F(u)  =  /  [x\(t) x\(t)  +  u2 (t))  dt x\( lOj+x^lO), 

Jo 

(  oD^x^t)  =  x2(t), 

subject  to  <  qD'J2 x2 (f )  =  0.2 x2{t)  —  x\ (t)  —  0. lccf  (t)  +  u(t), 

(xi(0)  =  2,  x2(0)  =  0. 

When  a  =  (1, 1),  the  dynamical  system  in  this  example  is  the  Duffing  equation  which  is 
known  to  display  chaotic  behaviour  without  any  controls.  We  first  solve  the  problem  for 
various  choices  of  N  when  ai  =  a2  =  1  to  demonstrate  that  our  method  converges.  The 
computed  optimal  values  of  F  are  listed  in  Table  5.5.  As  can  be  seen  from  the  table,  the 
numerical  solution  improves  as  N  increases.  The  computed  x\,x2  and  u  when  N  =  1000 
and  ct\  —  a2  —  1  are  shown  in  Figure  5.10. 
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Optimal  u1  for  Example  3  with  a1=0.9  and  a2= 0.5  Optimal  u2  for  Example  3  with  <^=0.9  and  «2=0.5 


Figure  5.8:  Optimal  values  of  x1}  x2  and  Ui,u2  for  Example  3  with  an  =  0.9,  a2  =  0.5 


N 

300 

600 

800 

1000 

1200 

F 

10.5906 

9.3284 

9.2114 

9.1659 

9.1464 

Table  5.5:  Optimal  costs  of  Example  4  when  an  =  ct2  =  1. 


We  now  solve  Example  4  when  an  =  0.7,  a2  =  0.8  using  the  uniform  mesh  with 
N  =  1200.  The  computed  optimal  value  of  F  is  6.9390  and  the  computed  optimal  xi,x2 
and  u  are  shown  in  Figures  5.11.  From  this  example  we  see  that  the  fractional  optimal 
control  can  achieve  a  better  optimal  solution  than  its  integer  counterpart.  In  fact,  this 
phenomenon  is  true  for  all  of  our  examples  in  this  section. 
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Optimal  xl  and  x2  for  Example  3  with  a1=0.2  and  a2=0.3 


optimal  u1  for  Example  3  with  a1=0.2  and  «2=0.3  Optimal  u2  for  Example  3  with  a ^=0.2  and  «2=0.3 


Figure  5.9:  Optimal  aq,  x2  and  ui,u2  for  Example  3  with  aq  =  0.2 ,a2  =  0.3 

6  Conclusion 

In  this  paper,  we  presented  a  numerical  method  for  solving  nonlinear  fractional  optimal 
control  problems  with  multiple  states  and  controls.  We  first  devised  a  novel  2nd-order 
numerical  integration  technique  for  the  fractional  dynamical  system  using  a  set  of  ju¬ 
diciously  chosen  quadrature  points.  Based  on  this  numerical  integration  technique,  we 
then  proposed  a  scheme  for  the  discretization  of  the  continuous  fractional  optimal  con¬ 
trol  problem.  A  formula  for  calculating  the  gradient  of  the  discretized  cost  function  with 
respect  to  the  decision  variables  has  been  derived  and  a  gradient-based  algorithm  has 
been  proposed  for  finding  an  optimal  solution  to  the  discretized  optimal  control  problem. 
Numerical  experiments  on  several  non-trivial  fractional  optimal  control  problems  have 
been  conducted  and  numerical  results  from  these  experiments  show  that  our  method  is 
accurate  and  robust  with  respect  to  the  orders  of  the  fractional  systems.  The  numerical 
results  also  show  that  the  method  is  able  to  solve  real-world  fractional  optimal  control 
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Optimal  value  of  x  1  for  example  4  at  a1=a2=1 


Optimal  value  of  x  2  for  Example  4  at  a1=a2=1 


Optimal  value  of  u  for  example  4  at  a1=a2=1 


Figure  5.10:  Optimal  values  of  xi,  X2  and  u  for  Example  4  with  aq  =  aq  =  1 


problems  with  multiple  states  and  controls. 
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Optimal  value  of  x  1  For  Example  4  at  q1=0.7  and  a2=0.8 


Optimal  value  of  u  for  Example  4  at  a1=0.7  and  «2=0.8 


Figure  5.11:  Optimal  values  of  X\  and  X2  for  Example  4  with  an  =  0.7  and  =  0.8 
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Abstract.  In  this  paper  we  propose  an  efficient  and  easy-to-implement  nu¬ 
merical  method  for  an  a-th  order  Ordinary  Differential  Equation  (ODE)  when 
a  E  (0, 1),  based  on  a  one-point  quadrature  rule.  The  quadrature  point  in  each 
sub-interval  of  a  given  partition  with  mesh  size  h  is  chosen  judiciously  so  that 
the  degree  of  accuracy  of  the  quadrature  rule  is  2  in  the  presence  of  the  singu¬ 
lar  integral  kernel.  The  resulting  time-stepping  method  can  be  regarded  as  the 
counterpart  for  fractional  ODEs  of  the  well-known  mid-point  method  for  1st- 
order  ODEs.  We  show  that  the  global  error  in  a  numerical  solution  generated 
by  this  method  is  of  the  order  G(h 2),  independently  of  a.  Numerical  results 
are  presented  to  demonstrate  that  the  computed  rates  of  convergence  match 
the  theoretical  one  very  well  and  that  our  method  is  much  more  accurate  than 
a  well-known  one-step  method  when  a  is  small. 


1.  Introduction.  Modelling  and  optimal  control  of  many  practical  systems  in  engi¬ 
neering,  science  and  economics  traditionally  involve  Ordinary  Differential  Equation 
(ODE)  systems  of  integer  orders  [2,  24,  25,  27,  28,  29,  30].  While  integer  order  ODE 
systems  are  adequate  for  capturing  the  evolution  of  most  standard  phenomena,  it 
has  been  shown  over  the  last  two  decades  that  many  complex  systems  in  solid  me¬ 
chanics,  viscoelastics,  gas  diffusion  and  heat  conduction  in  porous  media,  signal 
and  image  processing,  bio-engineering,  biology,  economics  and  financial  engineer¬ 
ing  are  better  modelled  by  systems  with  fractional  or  non-integer-order  differential 
equations  (cf.,  for  example,  [3,  4,  5,  6,  7,  8,  22,  23,  26]).  In  particular,  complex 
phenomena  involving  memory  effects  can  be  modelled  more  appropriately  and  ac¬ 
curately  by  fractional  dynamical  systems  than  by  classical  (integer)  ones.  As  it 
is  very  rare  that  a  system  of  practical  significance  can  be  solved  analytically,  one 
needs  to  be  able  to  solve  the  system  numerically.  Clearly,  an  accurate  and  compu¬ 
tationally  efficient  numerical  scheme  is  crucial  for  solving  fractional  ODEs.  This  is 
particularly  true  when  solving  an  optimal  control  problem  involving  such  a  system 
as  an  iterative  computational  procedure  for  computing  the  optimal  control  requires 
the  solving  the  system  repeatedly. 
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We  consider  the  following  fractional  initial  value  problem: 


o  D?x(t)  =  f(t,x(t)),  f£(0,T],  (1) 

z(0)=a:o,  (2) 


where  xq  and  T  are  positive  constants.  /  is  a  known  function  and  o Dfx{t)  denotes 
the  following  Captuto’s  ath-order  derivative  of  x(t)  in  (0,T]  with  a  £  (0, 1): 


o  D?x(t)  = 


1 


f 


c'(r) 


(t-T-y 


dr. 


r(i-a) 

In  the  above  T(-)  denotes  the  Gamma  function.  Higher-order  fractional  initial  value 
problems  can  be  transformed  into  a  system  of  fractional  initial  value  problems  of 
the  form  (l)-(2)  and  any  efficient  numerical  method  developed  for  (l)-(2)  can  be 
extended  to  a  vector-valued  initial  value  problem.  There  is  also  another  repre¬ 
sentation  of  the  ath-derivative  called  the  Riemann-Liouvillc  fractional  derivative. 
However,  initial  value  problems  involving  the  Riemann-Liouville  fractional  deriv¬ 
ative  can  be  readily  transformed  into  (l)-(2)  as  demonstrated  in  [10,  14].  It  has 
been  proven  [10,  14,  16]  that  solving  (l)-(2)  is  equivalent  to  solving  the  following 
Volterra  integral  equation: 


x(t)  =  x0  +  -J^[  (t  —  r)a  1  f  (r ,  x{r))  dr ,  a  £(0,1).  (3) 

r(o0  Jo 

In  the  open  literature,  there  are  four  main  numerical  methods  for  solving  (3):  Eu¬ 
ler’s  method  [20],  an  Adams  type  predictor-corrector  method  [11,  12,  13,  19,  20],  the 
p- th  order  method  [20,  21]  and  the  block  by  block  method  [1,  15,  17].  Euler’s  method 
is  simple,  but  the  convergence  order  is  only  0(ha ),  where  h  denotes  the  mesh  size  of 
a  uniform  partition  of  (0,  T ).  The  Adams  type  predictor-corrector  method  was  first 
proposed  by  Diethelem  et  al.  [11].  They  showed  that  the  convergence  order  of  this 
method  is  0(h1+a).  Based  on  the  work  of  Diethelem  et  al.,  Deng  and  Li  [9]  have 
developed  another  improved  predictor-corrector  method.  They  proved  that  the  or¬ 
der  of  convergence  of  the  improved  version  is  0(/imm(1+2“’2)).  Both  of  the  schemes 
in  [9]  and  [11]  are  single  step  methods.  The  p- th  order  and  block-by-block  methods 
have  convergence  rates  of  orders  0(h3)  and  0(h3+a),  respectively.  However,  these 
methods  are  linear  multiple  step  methods  and  thus  computationally  much  more 
expensive  than  single  step  methods.  Therefore,  a  question  that  arises  is  whether 
it  is  possible  to  design  a  single  step  method  for  (3)  with  an  upper  error  bound 
better  than  0(hmin^1+2a,2'>)  when  a  <  0.5.  In  the  integer  case  that  a  =  1,  the 
mid-point  one  step  method  has  an  upper  error  bound  of  order  0(h2).  This  method 
takes  advantage  of  the  property  that  the  mid-point  quadrature  rule  yields  a  ‘super¬ 
convergence’  point  for  numerical  integration,  i.e. ,  the  mid-point  is  the  only  one  in 
an  interval  which  gives  the  exact  numerical  integral  when  the  integrand  is  a  linear 
polynomial.  Clearly,  this  super-convergence  property  of  the  mid-point  quadrature 
rule  does  not  hold  true  for  integrals  of  the  type  (3),  because  the  kernel  becomes 
singular  when  r  approaches  t.  Thus,  the  question  of  what  is  the  counterpart  of  the 
classical  implicit  mid-point  numerical  integration  method  for  (3)  arises. 

In  this  work,  we  design  a  one-step  numerical  method  for  (3)  which  is  easy  to  im¬ 
plement,  computationally  inexpensive,  and  results  in  a  global  error  of  order  0(h2) 
for  any  a  £  (0,1).  This  method  can  be  regarded  as  the  counterpart  for  frac¬ 
tional  ODEs  of  the  implicit  mid-point  numerical  integration  method  for  first-order 
ODEs.  In  this  method,  we  choose  a  numerical  quadrature  point  in  each  of  the  sub- 
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intervals  of  a  given  partition  judiciously  so  that  the  local  approximation  error  is  of  a 
higher  order  than  that  from  the  conventional  one-point  quadrature  rule.  This  is  the 
counterpart  of  the  mid-point  quadrature  rule  in  conventional  numerical  integration. 
Based  on  this  special  numerical  quadrature  rule,  we  develop  a  one-step  numerical 
integration  method  for  (3)  and  prove  that  the  global  error  in  the  numerical  solutions 
generated  by  this  method  is  of  order  0(h 2). 

The  rest  of  the  paper  is  organized  as  the  follows.  In  Section  2,  we  propose  an 
approximation  of  (3)  based  on  a  Taylor  expansion.  An  error  analysis  of  the  approx¬ 
imation  is  also  presented.  In  Section  3,  we  propose  an  algorithm  for  implementing 
the  approximate  equation  and  analyse  its  convergence.  In  Section  4  we  apply  our 
method  to  several  factional  ODEs  with  known  exact  solutions  to  confirm  the  theo¬ 
retical  error  estimate  and  demonstrate  the  superiority  of  our  method  over  some  of 
the  existing  ones.  Section  5  concludes  the  paper. 


2.  Approximation.  For  a  given  positive  integer  N,  we  first  divide  (0,T]  into  N 
sub-intervals  with  mesh  points  t.t  =  ih  for  i  =  0,1 , . . . ,  A,  where  h  =  T/N.  Thus, 
(3)  can  be  written  as  follows. 

x(ti)  =  x0+-*—[  (ti  -T)a~lf{T,x(T))dT 

r(a)  J o 

=  zo  +  [  (ih  -  t)“_1/(T,  x(t))  dr 

r(o0  Jo 

=  ^0  +  7T7-T  /  (*/l  —  t)“_1/(t,  x(r))dT.  (4) 

We  now  consider  approximation  of  the  integral  on  right  hand  side  of  (4).  Assume 
that  f(t,x(t))  is  twice  continuously  differentiable  with  respect  to  both  t  and  x.  For 
j  =  1,  2, . . . ,  i,  we  apply  Taylor’s  theorem  to  /(t,  x(t))  at  to  get 

/(r,  x(t))  =  f(Tij,x(Tij))  +  Kij(T  -  Tij)  +  Cij  (r  -  Tij )2,  (5) 

where  cl3  a  constant  representing  the  2nd  derivative  of  /  at  a  point  between  r,?-  and 
r  and 


A  j  j  ,/V  (  X'jj  ,  X  (  Tjj  )  )  T  fx  {Xij  ,  x(Tij  ')')xr  {t^j  ) . 

Therefore,  replacing  /(r,  x(t))  in  the  integrand  of  the  last  term  in  (4)  with  the  RHS 
of  (5),  we  have,  for  any  j  =  1, ...,  i, 


[  (ih  —  t)“  1f(T,x(r))dT 

7-T  /  (ih  -  x)a~^[f  (t^j ,  X (tij ) )  +  Kij(T  -  Tij)]dT  +  Ri 

(a)  JU-i)h 

(ih~(j~l)h)a  (ih~jh)c 


T(a) 


T(c 


T(a) 


f(nj,x(Tij)) 


a 


a 


+  W2\  f  (ih-T)a  l(T-Tij)dT  +  Ri 

J-  J (j  —  l)h 


T(a  +  1)  ‘ 


f(Tij,x(Tij))l(i  -  j  +  l)a  -  (i  -  j)a] 
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+  /  ( ih-r)a  \T^TlJ)dT  +  RlJ,  (6) 

l  (a) 

1  fjh 

where  %  =  — —  /  (ih  -  r)a_1c!;i(r  -  Ty)2dr. 

^  l^V  J ( j  —  l)h 

We  now  consider  the  choice  of  r^-.  From  (6)  it  is  clear  that  rtj  should  be  chosen 
such  that  the  second  term  becomes  zero  so  that  the  truncation  error  in  (6)  is  just 
Rtj.  The  choice  of  t,,  is  given  in  the  following  theorem. 


Theorem  2.1.  For  any  given  i  =  1,  2, . . . ,  N  and  j  =  1,  2  . . . ,  i,  the  unique  solution 
to  j 

[  (ih  —  t)“-1(t  —  Tij)dr  =  0  (7) 

is  given  by 

[(*  -  3  +  l)a+1  -  (*  -  J')“+1]  +  (a  +  !)[(»  -  3  +  1  )a(j  -!)-(*-  j)a3] 


=  hl- 


(a  +  l)[(i  -  j  +  l)a  -  (i  -  j)a] 

Furthermore,  ( j  —  1  )h  <  Ttj  <  jh. 

Proof.  We  first  integrate  the  LHS  of  (7)  using  integration  by  parts  as  follows. 


(8) 


rih 


( ih  —  r)a  1(t  —  Tij)dr  =  — 


(ih  —  t)c 


(T  Tij ) 


jh 


(j-l)h 


(ih  —  t)q+1 


x(a  +  1) 


jh 


(. 3-1)/* 


[(*  -  j)aj  +  1  )a(j  -  1  )}ha+1 


a 


+ 

=0. 


l(i-j  +  l)a-(i-j)a]ha 

- T 

a 

(i  ~  j  +  l)a+1  ~  (i-j)a+1j 

a(a  +  1) 


,ct+ 1 


From  the  above,  we  have 


[(i 


—  3  + 1)“  —  (i  —  j)a}nj 
(i  -  j  +  l)a+1  -  (i  -  j)a+1 

a  +  1 


h+  [(i 


j  +  l)a(j  -1)-  (i-j)aj]h. 


Solving  this  for  t we  get  (8). 

We  now  show  that  (j  —  1  )h  <  t.^  <  jh.  From  (8), 


[(i  -j  +  l)a+1  -  (i  -  j)a+1]  +  (a  +  l)[(i  -  j  +  1  )a(j  -  1)  -  (i  -  j)aj } 
(a  +  1)[(*  -  j  +  l)a  -  (i-j)a] 

[(»  -  3  +  l)a+1  ~  (i  -  J')a+1]  +  (<*  +  l)[(i  -  3  +  1  )a(j  -!)-(»-  3)aj] 
(a  +  l)[(i-j  +  1)«  -  (i  -  j)a] 

_  (a  +  l)j[(»  -  j  +  l)a  -  (i  -  j)a] 

(a  +  1)  [(*  -  j  +  l)a  -  (i  -  j)a] 
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(i  —  j  +  1)Q+1  —  {i  —  j)a+1  —  (a  +  l)(i  —  j  +  1)“ 

(a  +  l)[(i  -  j  +  1)Q  -  (i  -  j)a] 

To  prove  —  jh  <  0,  we  only  need  to  show  that 

(i-  j  +  1)“+1  -  (i  -  j)a+1  -  (a  +  l)(i  -  j  +  l)a  <  0. 

Using  the  mean  value  theorem,  we  have 

{i- j  +  l)a+1  -  (i- j)a+1  -  +  j  +  l)a  =  (a  + 1)£“  —  (a  + 1) (i  —  j  +  1)Q  <  0, 

since  £  £  (i  —  j,  i  —  j  +  1).  Thus,  we  conclude  that  —  jh  <  0.  In  a  similar  way, 
we  can  prove  that  Tij  —  (j  —  l)h  >  0.  D 

Combining  Theorem  2.1  and  (4),  we  have  the  following  representation  for  x{ti). 


x(U)  =  x0  + 


T(a  +  1) 


Y  f(Tij,x(Tij))[{i  ~j  +  1)“  -  (i  -  j)a]  +  Ri 


(9) 


i= i 


for  *  =  1,2,...,  AT,  where  t. ij  is  given  in  (8)  for  j  =  1,2, ...  ,i  and  Ri  =  Yl)=i  Rij- 
Omitting  the  remainder  Ri  in  (9),  we  have  an  equation  approximating  (4)  which  has 
the  truncation  error  Ri.  An  upper  bound  for  Ri  is  given  in  the  following  theorem. 

Theorem  2.2.  Suppose  that  f{t,x{t))  is  twice  continuously  differentiable  in  t  and 
x.  Then,  for  any  i  =  1,2, ..,  N,  the  following  estimate  holds: 

\Ri\<Ch2,  (10) 

where  C  denotes  a  positive  constant  independent  of  h  and  a. 

Proof.  For  j  =  1,2,...,*,  from  the  definition  of  Ri  and  Theorem  2.1  we  have 


I  Rij  I  — 


< 


Cij 

r(«L 

rjh 

/  (■ ih-r)a~ 

'O-i  )h 

\cij\h2 

rjh 

/  (ih-r)a 

T(a) 

\cij\  h2 

([(i-j  +  l)h]a 

T(a) 

\  a 

Since  /  is  twice  continuously  differentiable,  Cij  is  bounded  on  [0,  T],  Let  c  = 
maxi(maxi<j<i  |c,j|).  Then,  from  the  definition  of  Ri  and  the  above  estimate, 
we  have 

i 

1^1  <  El^il 

l=i 

<  ch2  ^  \  [{i  -  j  +  l)h]a  [( i-j)h] 

~  r(a)  \  a  a 

ch 2  (■ ih)a 
T(a)  a 
ch2  Ta 

~  r(a)  a 

=  Ci/)2, 
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where  C\  =  =  T^i+a^Ta.  Since  a  £  (0,1),  from  [18]  we  have  2“  1  < 

T(1  +  a)  <  1.  Also,  it  is  obvious  that  Ta  <  max{l,T}  for  a  £  (0,1).  Therefore, 
from  the  above  estimates  we  see  that  (10)  holds  true  for  a  C  independent  of  both 
h  and  a.  Thus,  we  have  proved  the  theorem.  □ 

From  (9)  it  is  clear  that  to  compute  x(U),  we  need  to  calculate  f(rij,x(Tij)),j  = 
1,2,...,?'.  However,  x(rij),j  =  1 ,...,?,  are  not  available  directly  from  the  scheme, 
although  the  r.t?  are  known.  Thus,  approximations  for  x{jij),j  =  1  ,...,*,  need  to 
be  determined.  In  the  next  section,  we  propose  a  single  step  numerical  scheme  for 
implementing  (9)  when  the  remainder  Ri  is  omitted. 

3.  Algorithm  and  convergence.  In  this  section,  we  propose  an  explicit  time¬ 
stepping  algorithm  for  approximating  the  solution  of  (9)  when  Ri  is  omitted,  based 
on  the  linearization  of  the  nonlinear  term  in  x(U)  in  (9). 

For  any  indices  i  and  j  satisfying  1  <  j  <  i  <  TV,  since  by  Theorem 

2.1,  we  use  the  following  linear  interpolation  to  approximate  a;(r,j): 

x{Tij)  =  x(tj- 1)  +  pij(x(tj)  -  x{tj- 1))  +  0{h2),  (11) 

where 

P,,  -  T'J  1  €(0,1).  (12) 

Then,  we  approximate  /(r^,  x(jij))  as  follows. 

f\- =  f  (Tl3,x{tj- 1)  +  Pij(x(t3)  -  x(tj-i)))  +  0(h2)  (13) 

for  1  <  j  <  i  <  N.  Clearly,  if  we  replace  f(Tij,x(rij))  in  (9)  with  the  above  expres¬ 
sion  and  omit  the  truncation  error  terms  of  order  0(h 2),  we  define  the  following 
single  step  time-stepping  scheme  for  (3): 

i 

Xi  ==  x0  T  ha  ^  ]  f  {r%ji  Xj— i  +  pij[xj  Xj~ i))  [(?  j  T  1)  (i  j )  ] 
t=i 

=  +  hagi(x o,®i,  ••.,  Xi_i)  +  haf(TU,Xi-i  +  pu(xi  -  x^))  (14) 

for  i  =  1,  2, . . . ,  N,  where  r?J-  is  defined  by  (8)  and 

ha  =  ha/T(l  +  a),  (15) 

Si (X0i  Xi,...,  Xi—  i ) 

K  <=1* 

I  (Tij’Xj~1+  Pij(Xj  -  x3- 1))  [(*  -3  +  !)“  -  (*  -  j]a],  i  >  1- 

U=i 

(16) 

The  above  scheme  is  implicit  as  the  last  term  on  the  RHS  of  (14)  contains  a 
nonlinear  function  of  ay.  We  can  use  an  iterative  method  such  as  the  conventional 
Newton’s  method  to  solve  (14)  which  is  usually  computationally  more  expensive 
than  the  predictor-corrector  process.  However,  for  this  case,  we  may  define  an 
explicit  single  step  scheme  by  further  approximating  the  last  term  in  (14)  by  the 
following  truncated  Taylor’s  expansion: 

f  (t ii ,  Xi  —  1  T  pa  ( Xi  Xi—  1 ))  —  f  (th  ,  Xi—  1 )  T  fx  (l ~ii  i  Xi—  \  )  pa  [Xi  Xi—  1 )  4 ~&(h  ).  (17) 

Combining  (14)  and  (17),  we  propose  the  following  explicit  one-step  algorithm  for 
(3). 
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Algorithm  A: 

1.  For  a  given  positive  integer  N,  let  it  =  ih  for  i  =  0, 1, N,  where  h  =  T/N. 

2.  For  i  =  1,2,...,  JV,  compute 

X0  ha(}i  (X()i  X]  .  . . . ,  Xi —  1 )  ha  (/'  (t-h  ,  Xi—  i )  piifx  {Xii  )  1  )*U— l)  /icm 

Xi  =  Z  7  77  7"  7  ) 

J-  P/cxPiiJx\J~iii  ^i—1) 

where  t^,  pi:),  ha  and  gi(xo, ...,  a;,_i)  are  defined  in  (8),  (12),  (15)  and  (16), 
respectively. 

As  can  be  seen  later,  Algorithm  A  provides  an  efficient  and  stable  2nd-order 
method  for  (3).  Strictly  speaking,  Algorithm  A  is  a  multiple  step  method.  This  is 
because  the  fractional  derivative  is  a  global  operator  and  thus  all  Xj  ,j<i  are  needed 
in  order  to  evaluate  Xi  in  Algorithm  A.  However,  since  the  last  two  terms  in  the 
numerator  on  the  RHS  of  (18)  only  involve  Xj_i,  we  still  call  it  a  one-step  method. 
Also,  unlike  the  case  of  the  explicit  mid-point  method  for  first-order  ODEs,  it  is 
generally  very  hard  to  construct  a  one-step  explicit  or  predictor-correct  method 
based  on  (9).  This  is  because  construction  of  such  a  scheme  usually  requires  a 
fractional  Taylor  expansion  which  typically  has  a  truncation  error  of  order  0(hna ) 
for  some  positive  integer  n.  Clearly,  na  — >  0  as  a  approaches  0. 

Using  linear  interpolation  theory  and  Taylor’s  theorem,  we  are  able  to  prove  that, 
for  any  i  =  1, 2, . . . ,  N,  Xi  generated  by  Algorithm  A  converges  to  x(ti)  at  the  rate 
0(h2)  when  h  — >  0+.  This  is  formalized  in  the  following  theorem. 

Theorem  3.1.  Let  x(t)  be  the  exact  solution  of  (3)/(f).  If  f(t,x)  is  twice  con¬ 
tinuously  differentiable  in  t  and  x,  then  there  exists  an  h  >  0  such  that  ha  <  h 
implies 

\x{ti)  -  Xi\<Ch2,  i  =  l,2,...,N,  (19) 

where  {xi}  is  the  sequence  generated  by  Algorithm  A,  C  is  a  positive  constant, 
independent  of  h  and  a,  and  ha  is  defined  in  (15). 

Proof.  In  what  follows,  we  let  C  denote  a  generic  positive  constant,  independent  of 
h.  We  now  prove  this  theorem  by  mathematical  induction.  Firstly,  we  show  that 
(19)  holds  for  i  =  1. 

By  (9),  we  have 

x{h)  =  x0  +  hQ/(ru,a:(rii))  +  . 

Furthermore,  from  (11)  and  (13),  we  have 
x(ti)  =  x0  +  haf(Tu,x0  +  pii(x(t\)  -  x0)  +  0(h2)  +  Ri 

=  +  ha{f(rn,x  o)  +  fx(xii,  Xo)[pn(x(ti)  —  Xo )  +  0(h2)]  +  0(h~)}  +  Ri 

=  x0  +  ha  [f  (th,  x0)  -  fx(T11,x0)pux0}  +  hafx(Tu, x0)pnx(ti)  +  0(h2) 

+  R\, 


where  pu  is  defined  in  (12).  Solving  this  equation  for  x(ti)  and  using  (18)  with 
i  =  lwe  have 

)  _  X0  +  hg  [/(tii,2;o)  -  fx(Tii,x0)p11x0\  +  Q(h2)  +  Ri 

1  hafx  (th  ,  Xo^pn 

x  |  Q(h2)  +  iU 

1  —  hafx(jn,Xo)pil 


DISTRIBUTION  A.  Approved  for  public  release:  distribution  unlimited. 


280 


WEN  LI,  SONG  WANG  AND  VOLKER  REHBOCK 


Therefore,  using  (10),  the  previous  equation  yields  the  inequality 

Ch 2 


\x(h)  -  Xi\  < 


1  -  Kfx{r11,x0)p11\ 


(20) 


Note  pn  £  (0,1)  by  (12)  and  ha  >  0.  So,  if  fx(r \i,xq)  <  0,  from  (20)  we  see 
that  (19)  is  satisfied  for  i  =  1.  However,  when  fx(ru,  Xq)  >  0,  we  need  to  choose 
an  upper  bound  for  ha  so  that  the  denominator  of  (20)  is  bounded  below  by  a 
positive  constant.  Clearly,  for  a  given  constant  er  £  (0, 1),  if  we  choose  hi  := 

- 7 - then  (19)  is  satisfied  for  i  =  1,  when  ha  <  hi.  To  be  more 

general,  we  choose 


h  := 


1  —  a 


max{maxi<i<JV  pufx{ru,  xt- 1),  1}  ’ 


and  when  ha  <  h,  (19)  is  satisfied  for  i  =  1. 

We  now  consider  the  case  of  i  >  2.  Assume  that 

max  I x(tj)  —  xA  <  Ch2 

i<j<i-i  J  J 


(21) 


(22) 


when  ha  <  h.  We  now  show  that  max  \x (tj)  —  Xj  \  <  Ch2,  or  equivalently,  \x(ti)  — 

1  <j<i 

Xi\  <  Ch2. 

Using  (13)  and  (17),  we  have,  from  (9)  and  (10) 
x(U) 

i 

=xo  +  K^2{  [f{Tij,x(tj- 1)  +  Pij(x(tj)  -  x(tj- 1))  +  Q(h2))] 

1= 1 

X  [(*  -3  +  !)“  -  (i  -  j)“]}  +  Ri 
=x0 

i- 1 

+  ^^/(nj,i(fi-i)+ftjWtj)  -  x{tj-i))  +  <D(h2))[(i  -  j  +  l)a  -  (*  -  j)“] 
i=i 

+  ha  {f(Tn,x(ti-i))  +  fx(Tii,x(ti-i))pu(x(ti)  -  ir(U_i))]  +  0(h2)  +  Ri 
=xq  +  Ai—  i  +  Bi  +  0(h2).  (23) 


where 


2—1 

-Aj —  l  — ha}  ^ ./'  ( Xjj .  x  ( tj  _  j )  T  Pij  (x(tj)  x(tj~  i)) 
l=i 

+o(/i2))[(i-i+ir-(*-in 

^2  — ha  X (ti—  1 ) )  T  fxixii,  x(tj—\  ))p,;j(x(tj)  ^(U— l))]- 


(24) 

(25) 


Note  that  (18)  can  be  re-written  in  the  following  form. 


2-1 

Xo  +  ha^fiTijtXj-i  +  pij(xj  -  Xj-i ))[(i-j  +  1)“  -  (i  -  j)a] 
l=i 

~hha[f  (th,  X%— i)  4-  f X(xn,  Xj,—  i) pa(Xi  £2— l)] 

Xq  +  +  Bi ,  (26) 
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Ai-i  =  ha^jf(Tij,xj-1+ pij(xj  -  Xj-^Ki- j +  l)a  -  (i- j)a],  (27) 

3= 1 

Bi  —  ha  [f(ra ,  Xi—  i )  T  fx {th ,  Xi— i )pa  (xi  Xi—  i)]  ■  (28) 

Subtracting  (26)  from  (23)  gives 

x{ti )  —  Xi  =  —  Ai_i)  +  (Bi  —  Bi)  +  0(h2).  (29) 

Let  us  first  estimate  Bi  —  Bi.  From  (25)  and  (28),  we  have 

Bi  B^  [ha  [f  (tu  ,  x(ti—  1 ))  d"  fx  (p~a  ?  x(ti  —  i  ))pa  (x(fi)  x(fi—  1 ))] 

-  ^[/(Ui^i-l)  +  fx{Tii,Xi-i)pu(Xi  -  £j_i)]] 

=ha[f(Tu,  x(ti-i))  -  /(Tii^i-i)] 

“b  hapii \fx  (p~ii  i  x(ti— ±)^x(ti)  fx{TiiiXi—l)Xj\ 

hfxPii  [fx  iAiii  x(ti — fffjx{t-i— i)  /a:  ("Ti*  ?  1  )*U  —  l]  ■  (dO) 

Note  that  /  is  twice  continuously  differentiable.  Using  a  Taylor  expansion,  we  get 
fx{n%,x{ti- 1))  =  fx(Tii,Xi-i)  +  Vi,  (31) 

where 

Xj  —  f xx{j~iii  £f)(x(ti—  l)  l) 

with  £  being  a  point  between  x(ti-±)  and  Xi-±.  Since  \x{ti-\)  —  Xi-±\  <  Ch 2  by 
Assumption  (22),  we  have 

rt  =  fxx(TU,0-O(h2)  =  O(h2). 


Similarly,  we  have 

f{Ta,x{ti- 1))  -  f(Tu,Xi- 1)  =  0{h2). 

Using  (31)  and  the  above  estimate  we  have,  from  (30), 

Bi  Bi  — ha(D(h  )  T  hapn\f xiCiii  Xi—\)(x(ti)  Xi)  T  x(ti)Vi\ 

haPii  [fx  (X'ii ,  Xi—  \  )  (x(U_  i ))  Xi— \  )  “b  X (ti—  \  )r.j] 

— h(xpiifx{xiiiXi—\f[x{ti)  Xj\  T  hapiifx(XiiiXi—i)[Xi—\  x[ti— i)] 

+  0(h2+a) 

=hapiifx(xu,  Xi-\)[x(ti)  -  xf\  +  0(h2+a), 

since  ha  =  ha /T(l  +  a)  and  \xi-i  —  x(ti- 1)|  <  Ch2  from  Assumption  (22).  Thus, 
from  the  above  expression  and  (29),  we  get 

x(t'i )  Xi  ( Aj_ ]  Ai_i )  T  haPiifx(^TiijXi—-f)[x(ti)  (^i)]  ~b  0(h  ). 


This  implies 


and  so 


x(ti)  -  Xi 


(Ai-i  —  Ai- 1)  +  0(h2) 
1  haPiif x(THt  Xi—l) 


I x{ti)  -  Xi  I  < 


|Ai_1-A_1|  |  Q{h 2) 

1 1  haPiifx  (P~ii  ?  1 )  |  1 1  haPiifx  {j~ii  i  Xi—  1 )  | 


(32) 


To  estimate  | x{tf)  —  Xi\,  we  need  to  estimate  |A;_i  —  Ai- 1|.  To  simplify  our 
notation,  we  let  Xij  =  Xj-i  +  Pij(xj  —  Xj- 1).  We  also  use  either  the  RHS  or  LHS 
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of  (11)  to  represent  the  point  x(r^).  From  the  definitions  of  Ai_i  and  Ai_i  in  (24) 
and  (27),  respectively,  and  using  (11),  we  have 


I  A.  j, — i  — hQ 


2-1 


J^lf(Tij>x(Tij))  ~  f{Tij,Xij)][(i-  j  +  1)“  -  (i-j)a] 


3=1 

2-1 


<ha  Y  I  [f(Tij,x(Tij))  -  f{Tij,  Xij)]  [( i  -  j  +  1)“  -  (i  -  j)a]  I 
3=  1 
2-1 

=KY  ~  [( i-j  +  1)“  -  (*  -j)a],  (33) 

i=i 

since  za  is  an  increasing  function  of  2  for  a  £  (0, 1).  Because  /  is  twice  continuously 
differentiable,  we  have  (recall  that  C  is  a  generic  positive  constant,  independent  of 
h) 

\f(Tij,  X  (Tij))  -  f(TijtXij)\  <C\x(t13)  -  Xij  | 


=C 


[x(tj- 1)  +  Pij{x{tj)  -  x{tj- 1))  +  0{h2)\ 


-  [Xj_  1  +  Pij{Xj  -  Xj- 1) 


=C 


[x(^-i)  -  Xj_i]  +  Pij[x(2j)  -  Xj)] 


+  Pij(xj_i -x(2j_i)]  +  (D(h2) 

<C  (\x(tj- 1)  -  Xj_i|  +  |x(2j)  -  Xj)|  +  \x(tj- 1)  -  Xj_i|) 

+  0^), 

since  p,;j  £  (0, 1).  In  the  above  we  have  used  (11).  Thus,  from  Assumption  (22),  we 
have 

I f(Tij,x(rij))  -  f(nj,xri.) |  <  Ch2. 

Replacing  |/(r^-, x(r^))  —  /(r^-, xr. .)|  in  (33)  with  the  above  upper  bound,  we  have 

2-1 

\Ai-!-  Ai-X\  <  haCh2Y\^~j  +  l)a-ii-j)a] 

3=1 

ha 

= _ n _ Ch2^a 

T{a  +  1) 

h° 


< 


T(a  +  1) 


C 


T(a  +  1) 

c 

~  T(a  +  1) 
<  Ch2. 


Ch2(ia  -  1) 
Ch2Na 
h2(hN)a 
h2Ta 


Combining  the  above  error  bound  with  (32),  we  have 

,  .  ,  Ch2 

| X{ti)  -  Xi  |  < 


1  haPiifx(Tii^Xi—x)\ 
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Therefore,  when  ha  <  h1  where  h  is  defined  in  (21),  we  have 


I  x(ti) 


Xi\  < 


_ Ch2 _ 

|1  haPiif x{tHt  Xi— 1)| 


Ch 2 

a 


for  a  given  cr  >  0. 

Careful  readers  may  have  noticed  the  positive  constant  C  used  in  the  above 
proof  is  independent  of  h ,  but  a  function  of  T“/T(  1  +  a).  However,  in  the  proof 
of  Theorem  2.2  we  have  shown  that  Ta/T(l  +  a)  is  bounded  above  by  a  positive 
constant,  independent  of  a.  Thus,  we  have  proved  the  theorem.  □ 


We  comment  that  in  the  above  theorem  we  have  established  an  upper  error 
bound  of  order  0{h 2)  for  the  numerical  approximation  to  (l)-(2)  generated  by 
our  proposed  single-step  Algorithm  A,  while  all  of  the  existing  single-step  methods 
proposed  in  [9,  11,  12,  13,  19,  20]  have  the  drawback  that  their  rates  of  convergence 
approach  O(h)  as  a  decreases. 

We  also  comment  that  Algorithm  A  is  a  linearized  form  of  our  implicit  method 
(14).  This  linearization  does  not  affect  the  C>(ft2)-order  rate  of  convergence  of  (14). 
In  other  words,  our  explicit  method  represented  in  Algorithm  A  performs  only  one 
Newton  iteration  for  (14)  as  performing  more  Newton  iterations  will  not  increase 
the  accuracy  of  the  numerical  method  due  to  the  discretization  errors. 


4.  Numerical  Results.  In  this  section,  we  solve  two  examples  using  our  proposed 
method. 

Example  1.  Consider  the  following  fractional  differential  equation 

0D?x(t)  =  ^5)  '  tA~a  -  x(t)  +t4,  t  6(0,1], 

1  (5  —  a) 

z(0)  =  0. 

The  exact  solution  is 

x(t)  =  f4. 

This  test  problem  is  taken  from  [9]  and  it  is  solved  by  Algorithm  A  for  various 
values  of  a  and  mesh  sizes  h.  The  computed  errors  Ehk  =  max1<i<1/^fc  \xi  —  a;(tj)| 
for  hk  =  1/ (2fc  x  10),  k  =  0, 1, ...,  6  and  the  chosen  values  of  a  are  listed  in  Table  1. 
To  estimate  the  rates  of  convergence,  we  calculate  log 2{Ehk/  Ehk+1)  for  k  =  0,1, ...,  5 
and  the  computed  rates  of  convergence  are  also  listed  in  Table  1.  From  the  table 
we  see  that  the  computed  rates  of  convergence  are  very  close  to  the  theoretical  one 
in  Theorem  3.1.  In  Table  1,  we  also  compare  our  results  with  those  obtained  by  the 
method  in  [9].  The  latter  method  is  a  single-step  predictor-corrector  method  with 
a  theoretical  rate  of  convergence  of  order  0(1 lmm(1+2“.2)).  The  rates  of  convergence 
of  our  method  are  much  higher  than  those  of  the  method  in  [9]  for  a  =  0.1.  From 
the  table  we  also  see  that  the  absolute  errors  from  our  method  are  much  smaller 
than  those  in  [9]  unless  a  is  close  to  1  in  which  case  classic  methods  apply.  Clearly, 
our  proposed  method  is  superior  to  that  in  [9].  In  addition,  one  can  expect  that 
a  predictor-corrector  method  should  be  computationally  more  expensive  than  our 
method. 

From  Table  1  we  see  that  when  a  is  close  to  zero,  the  computed  rate  of  con¬ 
vergence  is  slight  worse  than  0(h2).  This  may  be  because  the  constant  C  in  (19) 
contains  the  term  l/r(l  +  a),  as  noted  in  the  previous  section.  However,  when  h 
decreases,  the  rate  of  convergence  of  our  method  increases. 
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Table  1.  Maximum  Errors  and  Convergence  Rates  for  Example  1. 


h 

Our  results 

Results  from  [9] 

Our  results 

Results  from  [9] 

a=0.1 

Order 

a=0.1 

Order 

CO 

O 

II 

s 

Order 

a=0.3 

Order 

1/10 

4.06e-3 

- 

0.364 

- 

7.67e-3 

- 

- 

- 

1/20 

l.lle-3 

1.86 

0.170 

1.10 

2.00e-3 

1.94 

- 

- 

1/40 

3.02e-4 

1.89 

7.13e-2 

1.26 

5.17  e-4 

1.95 

- 

- 

1/80 

8.08e-5 

1.90 

2.88e-2 

1.31 

1.32e-4 

1.97 

- 

- 

1/160 

2.14e-5 

1.92 

1.15e-2 

1.32 

3.37e-5 

1.97 

- 

- 

1/320 

5.65e-6 

1.93 

4.64e-3 

1.31 

8.55e-6 

1.98 

- 

- 

1/640 

1.47e-6 

1.93 

1.88e-3 

1.30 

2.16e-6 

1.98 

- 

- 

1/1280 

3.84e-7 

1.94 

- 

- 

5.46e-7 

1.99 

- 

- 

h 

Our  results 

Results  from  [9] 

Our  results 

Results  from  [9] 

a=0.5 

Order 

a=0.5 

Order 

Q 

II 

o 

c o 

Order 

a=0.9 

Order 

1/10 

8.49e-3 

- 

0.0355 

- 

7.88e-3 

- 

0.0107 

- 

1/20 

2.16e-3 

1.98 

0.00879 

2.01 

1.97e-3 

2.00 

0.00231 

2.21 

1/40 

5.43e-4 

1.99 

2.16e-3 

2.03 

4.93e-4 

2.00 

5.21e-4 

2.15 

1/80 

1.37e-4 

1.99 

5.31e-4 

2.02 

1.23e-4 

2.00 

1.22e-4 

2.09 

1/160 

3.43e-5 

2.00 

1.31e-4 

2.02 

3.08e-5 

2.00 

2.94e-5 

2.06 

1/320 

8.58e-6 

2.00 

3.24e-5 

2.02 

7.70e-6 

2.00 

7.18e-6 

2.03 

1/640 

2.15e-6 

2.00 

8.03e-6 

2.01 

1.92e-6 

2.00 

1.77e-6 

2.01 

1/1280 

5.38e-7 

2.00 

- 

- 

4.81e-7 

2.00 

- 

- 

Example  2.  Consider  the  following  fractional  differential  equation 

0D?x(t)  =  ^  t3  +  t3+a  -  x(t),  t  €  (0, 1], 

6 

z(0)  =  0. 

The  exact  solution  is  x(t)  =  t3+a.  This  example  is  from  [1]  and  it  is  solved  using 
Algorithm  A  for  various  values  of  h  and  a.  The  computed  errors  and  rates  of 
convergence  are  listed  in  Table  2  from  which  we  see  that  the  computed  rates  of 
convergence  are  close  to  the  theoretical  one  in  Theorem  3.1. 

Table  2.  Maximum  Errors  and  Convergence  Rates  for  Example  2. 


h 

a=0.1 

Order 

CO 

O 

II 

$ 

Order 

a=0.5 

Order 

a=0.9 

Order 

1/10 

2.37e-3 

- 

5.08e-3 

- 

6.40e-3 

- 

7.50e-3 

- 

1/20 

6.45e-4 

1.88 

1.31e-3 

1.95 

1.62e-3 

1.98 

1.88e-3 

2.00 

1/40 

1.73e-4 

1.90 

3.39e-4 

1.96 

4.08e-4 

1.99 

4.69e-4 

2.00 

1/80 

4.60e-5 

1.91 

8.65e-5 

1.97 

1.03e-4 

1.99 

1.17e-4 

2.00 

1/160 

1.21e-5 

1.92 

2.20e-5 

1.98 

2.57e-5 

2.00 

2.93e-5 

2.00 

1/320 

3.18e-6 

1.93 

5.58e-6 

1.98 

6.44e-6 

2.00 

7.32e-6 

2.00 

1/640 

8.30e-7 

1.94 

1.41e-6 

1.98 

1.61e-6 

2.00 

1.83e-6 

2.00 

1/1280 

2.15e-7 

1.95 

3.55e-7 

1.99 

4.04e-7 

2.00 

4.57e-7 

2.00 

Example  3.  Consider  the  following  fractional  differential  equation 

o D?x(t)  =  r^4  +  +  ^4(3+a)  _  a-4^  f  €  (0,  1], 

0 
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x(0)  =  0. 

The  exact  solution  is  also  x(t)  =  t3+a.  Note  that  the  RHS  of  the  above  equation  is 
nonlinear  in  both  t  and  x.  This  example  is  solved  using  Algorithm  A  for  various 
values  of  h  and  a  and  the  computed  errors  and  rates  of  convergence  are  listed  in 
Table  3.  From  the  table  we  see  that  the  computed  order  of  convergence  is  greater 
than  2  when  a  is  small. 

Table  3.  Maximum  Errors  and  Convergence  Rates  for  Example  3. 


h 

a=0.1 

Order 

CO 

O 

II 

Q 

Order 

ct=0.5 

Order 

a=0.9 

Order 

1/10 

6.19e-2 

- 

4.27e-2 

- 

2.56e-2 

- 

3.70e-3 

- 

1/20 

1.69e-2 

1.87 

1.03e-2 

2.05 

5.70e-3 

2.17 

6.90e-4 

2.42 

1/40 

4.42e-3 

1.93 

2.368e-3 

2.13 

1.10e-3 

2.37 

1.88e-4 

1.88 

1/80 

1.10e-3 

2.00 

5.055e-4 

2.23 

1.85e-4 

2.57 

5.92e-5 

1.67 

1/160 

2.65e-4 

2.05 

1.025e-4 

2.30 

2.63e-5 

2.82 

2.11e-5 

1.49 

1/320 

6.26e-5 

2.08 

2.00e-5 

2.36 

2.63e-6 

3.32 

6.19e-6 

1.77 

1/640 

1.46e-5 

2.10 

3.77e-6 

2.41 

5.30e-7 

2.31 

1.68e-6 

1.88 

1/1280 

3.41e-6 

2.10 

6.86e-7 

2.46 

1.55e-7 

1.77 

4.37e-7 

1.94 

To  summarise,  from  Tables  1  -3  we  see  that,  although  computed  convergence  rates 
fluctuate  for  different  values  of  a  and  h,  when  ha  is  small  enough,  the  convergence 
order  is  2  confirming  our  theoretical  analysis.  This  can  also  be  seen  from  the  last 
row  of  each  of  the  tables  corresponding  to  h  =  1/1280.  All  the  errors  are  of  the 
magnitude  h2  ss  6  x  10- ' .  To  check  the  robustness  of  our  method  in  a ,  we  have 
also  solved  Example  3  for  a  =  10_l,  i  =  1,...,8  and  the  computed  results  show 
that  the  orders  of  convergence  are  roughly  2,  particularly  when  h  is  small.  The 
robustness  of  our  method  may  provide  an  effective  way  for  solving  problems  with 
algebraic  constraints,  or  differential  algebraic  equations.  We  will  discuss  this  in  a 
future  paper. 

5.  Conclusion.  In  this  paper,  we  proposed  a  new  numerical  method  based  on  Tay¬ 
lor’s  theorem  and  linear  interpolation  for  solving  fractional  differential  equations. 
The  proposed  method  is  simple  and  easy  to  use.  We  have  proved  that  the  con¬ 
vergence  order  of  the  method  is  2.  The  numerical  results  confirm  our  theoretical 
analysis. 
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Abstract.  We  develop  a  finite  difference  method  (FDM)  for  a  2D  frac¬ 
tional  Black-Scholes  equation  arising  in  the  optimal  control  problem  of 
pricing  European  options  on  two  assets  under  two  independent  geometric 
Levy  processes.  We  establish  the  convergence  of  the  method  by  showing 
that  the  FDM  is  consistent,  stable  and  monotone.  We  also  show  that 
the  truncation  error  of  the  FDM  is  of  2nd  order.  Numerical  experiments 
demonstrate  that  the  method  produces  financially  meaningful  results 
when  used  for  solving  practical  problems. 


1  Introduction 

In  this  paper  we  propose  a  2nd-order  numerical  scheme  for  a  2D  fractional  Black- 
Scholes  (fBS)  equation  arising  in  pricing  options  with  two  underlying  assets  [2], 
based  the  schemes  in  [4]  for  a  ID  fBS  equation.  We  prove  that  the  developed 
discretization  method  is  consistent,  stable  and  monotone,  and  thus  the  solu¬ 
tion  generated  by  the  numerical  method  converges  to  the  exact  one.  Numerical 
experiments  have  been  performed  to  demonstrate  the  order  of  convergence  and 
usefulness  of  the  scheme. 

It  is  shown  in  [2]  that  the  value  of  an  option  whose  underlying  asset  price 
follows  a  geometric  Levy  process  is  governed  by  a  ID  fBS  equation.  Under  the 
same  assumptions,  it  is  easy  to  show  that  the  value  U  of  a  two-asset  option  (eg. 
Rainbow  or  Basket  Option)  which  is  written  on  two  stocks  whose  prices  S\  and 
S2  following  two  independent  geometric  Levy  processes  (with  zero  correlation 
coefficient)  is  determined  by  the  following  2D  fBS  equation: 

CU  :=  -Ut  +  01  Ux  +  a2Uy  -  61  [_„ 0D°U\  -  62[_00D^C/]  +  rU  =  0  (la) 

for  ( x,y,t )  €  K2  x  [0,  T),  where  x  =  In  Si,  y  =  lnS2,  _oo D%U  and  -ooD^U 
denote  respectively  the  a-th  and  /3-th  derivatives  of  U  in  x  and  y  for  a,  (3  £ 
(1,2),  T  >  0  is  the  terminal  time,  r  >  0  is  the  risk-free  rate,  a  >  0  is  the 
volatility  of  the  underlying  asset  prices,  and  ai  =  —  r  —  ^<t“  sec  61  =  ai  +  r, 

a2  =  —  r  —  sec  ,  and  b2  =  a2  +  r.  In  computation,  the  domain  R2  has 
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to  be  truncated  into  1?  =  (:rmi„,  zmax)  x  (2/min,ymax)  satisfying  xmin,ymin  <  0 
and  xmax,2/max  >  0.  We  impose  the  following  boundary  and  initial  conditions 

U(x,y,t)  =  U0(x,y,t),{x,y)  £  dO,  U{x,  y,  T)  =  U*{x,  y),  (lb) 


where  Uq  and  U*,  satisfying  Uo(x,y,T)  =  U*(x,y)  for  (x,y)  £  dfl,  are  known 
functions  depending  on  the  types  of  option  and  the  strike  prices  K  of  the 
options.  Using  the  aforementioned  logarithmic  forms,  it  is  easy  to  show  that 
linia^-oo  Ux  =  0  and  lim.y^-oo  Uy  =  0  [4].  Thus,  when  xmin  and  ymin  are  suffi¬ 
ciently  small,  the  fractional  derivatives  in  (la)  become,  up  to  a  truncation  error, 
the  following  Caputo’s  type 


D?)TV 


Vxx{£,y,t) 

ra-(x-  00"1 


Vyy{xi£it)  jp\ 

/>•(!/ -o'3-1' V 


where  Pu  =  1/T(2  —  u).  In  what  following  we  will  omit  the  subscripts  xmin  and 
ym i„  in  the  above  derivative  representations.  Also,  for  any  (  =  (C1X2)  €  (0,  l]2, 
we  use  V*U  =  (D^1  U.  D^2U)T  to  denote  the  £-tli  order  gradient  operator,  where 
the  fractional  derivatives  are  of  the  Caputo  type. 


2  Solvability 

We  first  reformulate  (la)-(lb)  as  a  variational  problem,  and  then  show  that  the 
variational  problem  has  a  unique  solution.  Before  starting  this  discussion,  we 
introduce  some  function  spaces.  For  any  £  =  (Ci 5  C2)  and  Ci>C2  £  (0,1],  we  let 
H^(Q)  :=  {t;  :  v,  £  (L2(l?))2}  .  Define  |  •  |f  and  ||  •  ||f  by  |w||  =  ||V^|||2(fi) 
and  ||w||2  =  ||u||22(-fi)  +  |u|2.  Then  it  is  easy  to  show  that  |  ■  and  ||  •  ||^  are  semi¬ 
norm  and  norm  on  respectively.  It  has  been  shown  in  [7],  that  i7‘”(f?) 

equipped  with  ||  ■  ||^  is  a  Sobolev  space.  We  also  define  the  Sobolev  space  of  func¬ 
tions  the  homogeneous  boundary  trace  by  Uq(17)  =  {»:»£  H^(f2),v\en  =  0}. 

Without  loss  of  generality,  we  assume  that  Uo  defined  in  (lb)  satisfies  Uq  £ 
FT7  (17),  where  7  =  (cr,  (3).  Then,  under  the  transformation  V  =  U0  —  U,  (la)  can 
be  written  as  the  following  equation  with  boundary  and  payoff  conditions: 

CV  :=  -Vt  -  V  ■  (SVb-Dy  -  aV)  +  rV  =  /,  (2a) 

v  =  0  on  <917,  V  =  V*(x,y)  :=  U0(x,y,T)  -  U*(x,y),  (2b) 

where  a  =  (ai,a2)T,  B  =  diag(6i,&2)j  7— 1  :=  (a— 1,/3— 1),  and  f(x,y,t)  =  CUq. 
Using  the  notation  defined  above,  we  pose  the  following  problem: 

Problem  1.  Find  u(t)  £  H^2(f2),  such  that,  for  all  v  £  H^2(Q), 

+  A(u(t),v)  =  ( f(t),v ) 

almost  everywhere  (a.e)  in  (0,T)  satisfying  terminal  condition  (2b)  a.e.  in  17, 
where  A(u,v)  =  a  (Vu,  »)  +  (BV,7_1^,  \7v)+r(u,  v )  with  (•)  denoting  a  duality 
of  a  pair  of  dual  spaces. 
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It  is  easy  to  verify  that  Problem  1  is  the  variational  problem  of  (2a)-(2b)  (cf. 
[7]).  From  Lemma  2.1  in  [4],  we  have  shown  that  in  the  ID  case  A(-,  •)  is  coercive 
and  continuous.  Using  the  lemma  we  now  prove  that  A(-,  •)  is  also  coercive  and 
continuous,  as  given  in  the  following  lemma: 

Lemma  1.  There  exists  a  positive  constant  C,  such  that  for  any  v,w  £ 
Hg/2(f2),  and  t  £  (0,T)  a.e.  A(v,v)  >  C'||u||^2  and  A(v,w)  <  C'||u||7/2||w||7/2. 

The  proof  of  this  lemma,  based  on  Lemma  2.1  in  [4],  is  trivial  and  thus  omitted. 
Using  this  lemma,  we  have  the  following  result. 

Theorem  1.  There  exists  a  unique  solution  to  Problem  1. 

This  theorem  is  a  consequence  of  Lemma  1  and  Theorem  1.33  in  [9],  in  which  the 
unique  solvability  for  an  abstract  variational  inequality  problem  is  established. 
The  proof  to  Theorem  1  is  thus  omitted  here. 


3  Discretization 


Numerical  solution  of  standard  BS  equations  has  been  discussed  extensively  in 
the  open  literature  [11-14,19,21,23,24].  However,  there  is  a  very  limited  work 
available  on  the  numerical  solution  of  spatial  fBS  equations  [4,10].  Various  dis¬ 
cretization  schemes  have  been  developed  for  fractional  DEs  such  as  those  in 
[8,15-18].  In  this  section  we  will  present  a  2nd-order  scheme  for  (la),  based  on 
that  in  [4]  for  a  ID  fBS  equation. 

For  given  positive  integers  Mx  and  My ,  let  17  be  divided  into  rectangular 
meshes  with  nodes  (xj,  yj),  i  =  0, ..,  Mx,  j  =  0, . . . ,  Myi  where  Xi  =  xmm  +  ih\ 
and  yj  =  2/min  V  jh%  with  hi  =  (xmax  xTnjn )  f  Alx  and  /e2  =  (l/max  il-uun ) / At y  ■ 
For  a  positive  integer  N,  let  (0,  T)  be  divided  into  N  sub-intervals  with  the 
mesh  points  tn  —  T  —  nAt.  n  =  0, 1, . . . ,  IV,  where  At  =  T /N.  The  a-th  partial 
derivative  can  be  approximated  as  follows  [4]: 


DaxV{xuy3) 


7  —a 

QT/ 

T(2-a)f^o9k  l~k+l 


(3) 


for  any  *  £  {1,2,...,  Mx  —  1}  and  j  £  {1,2,...,  My  —  1},  where  Vi-k+i,j  is  an 
approximation  to  V(xi-k+i,yj,  t)  and  g%’ s  are  given  by,  for  k  =  3, 4,  ...,*  +  1, 

a  1  a  23_a  —  4  a  33-“-4x23-“  +  6 

50  _  (2  -  a)  (3  -  a)  ’  9l  ~  (2  -  a)(3  -a)’  92  (2  -  a)(3  -  a)  ’  W 

g%  =  gS [(k  +  1)3"“  -  4 k3~a  +  6 (jfc  -  1)3"“  -  4 (jfc  -  2)3"a  +  (jfe  -  3)3"“] .  (5) 

Lemma  2.  For  any  a  £  (1,  2),  the  coefficients  g k  =  0, 1, . . . ,  i  +  1  satisfy: 

(V  9o  >  0)  9?  <  0,  and  gl  >  0  for  k  =  3, 4, 5, . . . ,  i  +  1, 

(2)  there  exists  an  a*  £  (1,2)  such  that  gif  <  0  when  a  £  (l,a*)  and  gf  >  0 
when  a  £  (cc*,  2),  and 
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(3)  nt09k<o. 

The  proof  of  Lemma  2  can  be  found  in  [4].  Using  (3)  and  its  counterpart  for 
D$V{xi,Vj),  we  define  the  following  operators: 


(S^6^)U+  = 


f+i 


r( 2  -  a) 


,-0  J'+l 
2 


k—0 


n  2  -  /?) 


fc+i  I  ’ 


k—0 


Wj  =  ^(C"+1J  -  S,UT„  =  T((7Ji+1 


(6a) 

(6b) 


where  U^'l  denotes  an  approximation  to  U(xk,yi,tn).  Using  (6a)-(6b),  we  define 
the  following  scheme  for  (1): 


u; 


n+1 


-u; 


M  +0(a16xU?+1- 
+  (1-9)  ( ai5xU+  -  hSxU+ 


biS^f1  +  a25vU?+l  M^”+1  +  rU”+1) 

+  a2SxU+  -  b2S$U+  +  rU+)  =  0  (7a) 


for  i  =  1, . . . ,  Mx  —  1,  j  =  1, . . . ,  My  —  1,  and  n  =  0, . . . ,  iV  —  1  with  8  £  [0.5, 1] . 
The  boundary  and  payoff  conditions  are: 


U0  j  Uo(xo,yj,tn),  —  Uq(xmx  ;  Uj  >tn) ,  —  Uo(xi,  yo,tn),  (7b) 

U^My  =  U0 (xi ,  yMy  ,tn),  U%  =  U*  (Xi ,  y3 ,  TN)  (7c) 

for  all  feasible  ( i,j,n ),  To  rewrite  (7a)  into  a  matrix  form,  we  let 


IT 


—  (TTn  TTn  TTn  TTn  TTn  TTn  \ 

—  •  •  •  J  UMX- 1,1)  Ul,2l  •  •  •  )  UMX- 1,1)  •  •  ■  >  ^  1  ,  My  —1,1?  •  •  •  u  My-1,MX-1) 


Rearranging  (7a),  we  have 


(I  +  0M)  Un+1  =  (I  -  (1  -  0)M)  Un  +  {n+1~e,  (8) 

where  I  is  ( Mx  —  1 )  ( My  —  1)  dimensional  identity.  The  matrix  M  is  a  block 
matrix  which  has  (. My  —  1)  x  (My  —  1)  blocks,  and  the  size  of  each  block  matrix 
is  (Mx  -  1)  x  (Mx  -  1). 


A  +  Bi 

Bo 

0 

0 

b2 

A  +  B! 

Bo 

0 

b3 

b2 

A  +  Bi 

Bo 

Bm,-3 

b2 

A  +  Bi 

Bo 

0 

Bm,-2 

b3 

b2 

A  +  Bi 

Bo 

BMy- 1 

b3 

b2 

A  +  Bi 
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where 


Pi9o 

+  Vu 

j  =  i  +  1 

+?72)I  y. 

3  = 

0 

Pi9i 

+  §  At, 

j  =  i 

(M2  g{  +  r2At)iy, 

3  = 

1 

9i9i 

9i9k 

-  m, 

j  =  i  —  1  „ 

)  B?  =  S 

j  =  i  —  k  +  1 

(9292  -  in)ly, 
(929j)Iy, 

3  = 

3  = 

2 

3,4,. 

k  =  3,4, . . .  ,i 

My 

-  1 

0, 

otherwise 

0, 

otherwise 

—b\At  a±At 

=  r(2  —  a)hi’  m  =  2  hi 


_  —b2At 

“  r(2  -  /3)/if 


??2 


a2At 

~2h^' 


(10) 


and  I.y  is  the  ( Mx  —  1)  x  (Mx  —  1)  identity  matrix.  The  column  vector  fn+1-0  = 
(1  —  9)in  +  9 fn+1  is  the  contribution  from  the  boundary  conditions  (7b)-(7c), 
where  f"  and  fn+1  consist  of  contributions  of  boundary  values  at  tn  and  tn+\ 
respectively.  In  the  rest  of  this  paper,  we  choose  9  =  0.5  which  is  the  Crank- 
Nicolson  method  with  a  2nd-order  truncation  error. 

We  comment  that  though  the  discretization  method  is  developed  for  Euro¬ 
pean  option  pricing  problems,  the  principle  developed  is  applicable  to  com¬ 
plementarity  problems  involving  the  fractional  differential  operators  in  (la) 
governing  American  option  valuation  if  a  penalty  method  such  as  those  in 
[3,14,20,22,24]  is  used.  We  will  discuss  this  in  a  future  paper. 


4  Convergence  Analysis 

In  this  section,  we  show  that  the  solution  to  (7)  converges  to  the  viscosity  solution 
to  (1).  We  start  the  discussion  with  the  following  theorem: 

Theorem  2.  (Consistency)  The  finite  difference  scheme  for  (7a)  is  consistent 
with  a  truncation  error  of  order  0(At2  +  h\  +  h2)  when  9  =  0.5. 

Proof.  In  [4],  we  have  shown  that  the  finite  difference  scheme  for  the  deriva¬ 
tives  in  x  in  (6)  have  the  2nd-order  truncation  error  O(hf).  By  symmetry,  the 
finite  difference  schemes  in  y-direction  in  (6a)  and  (7a)  have  the  truncation  error 
0(h |).  It  is  also  known  that  the  Crank-Nicolson’s  scheme  used  in  (7)  has  the 
truncation  error  of  order  0(At2).  Therefore,  the  discretization  scheme  (7a)  has 
the  truncation  error  0(At2  +  hf  +  /i|). 

Theorem  3.  (Stability)  The  finite  difference  scheme  defined  by  (7)  is  uncondi¬ 
tionally  stable. 

Proof,  we  use  the  semi-discrete  Fourier  transform  to  prove  the  stability  of  the 
Crank-Nicolson  method  with  9  =  1/2.  From  the  definition,  we  see  that  all  the 
coefficient  matrices  in  (9)  are  Toeplitz  matrices.  Thus,  each  of  the  terms  in 
(9)  can  be  written  as  convolution  of  one  the  following  vectors  with  a  finite 


DISTRIBUTION  A.  Approved  for  public  release:  distribution  unlimited. 


A  2nd-0rder  FDM  for  a  2D  Fractional  BS  Equation 


51 


support  (•  •  •  ,  0,  (Ufc)T,  0,  •  •  •  )T  and  (•  •  •  ,  0,  (f™+1/2)T,  0,  •  •  •  )T  for  k  =  n  and 
n+ 1.  Applying  the  discrete  Fourier  transform  via  the  semidiscrete  Fourier  trans¬ 
form  pair  U?d  =  ^/^X^XeiKia‘+£aW)^"^^i^2  and  = 

h2hi  J2JL-oc  J2iL-oo  to  (8),  or  equivalently  replacing  [7+  and 

/”+1/2  With  Uke(itLihi+jti2h2)i  and  fn+l/2e(i^h1+j^h2)i  with  j  =  ^+1  for  aH 

admissible  i,  j  and  k  =  n,n  +  1,  we  obtain  a  system  in  Un+1.  Solving  the 
transformed  system  for  Un+1  we  have 


2- 


Un+1  =  - 


m  +  mi  Elio  9k e 


Ito  +  f]2  +  fi2  ^  g0ed-k)t2h2i  +  rAt 


Vi  +  Mi  Efcto  9k +  r?2  +  M2  Efcto  g^.e(-* 1~k^2h2 1  +  rAt 
2  At,fn+1/2 


Ur 


Mi  +  Mi  Elto  5fce^1_fe^1?lli  +  M2  +  M2  Efcto  gf  e(1“fc)«2ft.2i  +  rz\f 

where  Mi  =  Mi  (e^11  —  e_^l/tl1),  772  =  172  (e?2,t21  —  e-^21),  £1  £  [— 7r//ii,  w/hi], 
£ 2  £  [— 7r//i2, 7r//i2]  and  mi;M2>M1iM2  are  defined  in  (10).  Using  Euler’s  formula, 
we  rewrite  the  above  equality  as  follows. 


Tjn+l  _  1  -  [(^1  +  A2)  +  {Bi  +  B2)  1]  fTn _ At _ 

1  +  [(Ai  +  A2)  +  (Bi  +  B2)  i]  1  +  [(Ai  +  A2)  +  (Bx  +  B2)  i] 

where 


Ai  = 

Bi  = 


i+ 1 

y  9k  cos((l  -  k)£ihi)  + 


rAt 

T’ 


fc= 0 

771  sin(^ifei) 
2 


i+l 

+  Sill((1  _  fc)Cl^l), 


fc=0 


and  A2  and  B2  are  defined  by  replacing  the  superscript-subscript  pair  (a,  1)  with 
((3,2).  Taking  magnitudes  on  both  sides  of  the  above  equation,  we  have 


\un+1\  =  |Y 


'(1-  A)2  +  B2 

(1  +  A)2  +  B2 


At 


\fn+i\  .  (11) 

V(1  +  A)2  +  B2 


where  A  =  Ai  +  A2  and  B  =  Bi  +  B2.  We  now  show  that  <  1,  or 

A  >  0.  Omitting  the  superscripts,  we  have  from  Item  3  of  in  Lemma  2  that  —3  1  > 
Elto  fc^i  9k-  with  gk  >  0  when  k  >  3  for  all  i  >  3.  From  the  representations  of 
gk  in  (4)-(5),  we  have  that  50+32  >  0.  In  order  to  estimate  Ai  and  A2,  we  first 
derive  the  following  estimate 


i  +  l  i+l 

E)  gk  cos((l  —  k)£h)  =  go  cos (£,h)  +  31  cos  0  +  32  cos (— £ft)  +  gk  cos ((k  —  1  )£h) 

k= 0  fc=3 

i  +  l  i+l 

=  31  +  (go  +  32)  COS (£h)  +  9k  cos ((k  -  1  )(_h)  <  ^  gk  <  0. 
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Since  ^1,^2  <  0,  we  have  the  following  estimations 

y  cos ((1  -  k)fh  1)  +  ^  >  0,  cos ((1  -  k)£h2)  +  ^  >  0. 

k—0  k= 0 

Therefore,  Ai,A2  >  0  and  so  ^4  >  0.  Using  this  result  we  have  from  (11)  that, 
for  all  &  e  £],  *  =  1,2, 


U 


n+1 


< 


Ur 


<  ■■•  < 


Z\t 

f/° 


ra+l/2 


< 


[/' 


n— 1 


■^E|/fc+1/2 


fc=0 


< 


Z\f 

17° 


jn+l/2 

T 
f  TV 


jn- 1/2 

y^  |  yfc+i/2 


Using  Cauchy-Schwarz  inequality,  we  have 
2  nT2 


U' 


n+1 


<  c 


t/L 


AT2 


y  jfe+i/2 


fe=0 


<  C 


UL 


k= 0 


2  1 
+  N 


y  I  jfe+i/2 


fe=0 


for  any  n  <  iV  —  1,  where  C  denotes  a  generic  positive  constant,  inde¬ 
pendent  of  n  and  N,  Un+1,U°  and  fk+1/2  are  all  functions  of  G 
for  i  =  1,2.  For  any  continuous  function  W  on  17,  let  ||W||o,/t  = 
1^2 

(/12/11  X]y=i_1  Si=i_1  |W+|2)  denote  the  discrete  L2-norm  of  W.  Using 
the  properties  of  the  discrete  Fourier  and  its  inverse  transforms  (particularly 
Parseval’s  equality)  we  have 


l|uJ+1||L  = 


/*7r  /  h%  rrt/hi 


(^tt)  l/_7r//l2  J —TC/h1 


\Un+L\2d£  idfa 


< 


(2+ 


r»7r//i  2  rTt/hi 


'  —7c/h2  J  —ir/h\ 


|u0|+^6  + yE 


fTr/h2  r^/h\ 


N 


k= 0 


'  —  7r//i2  J  —  n/hi 


\fk+1/2\2d^df2 


=  C  ( ||U°||^  +  -  £  ||ffc+1/2||2,ft  )  <  C (||U°||2^  +  ||f|+)  , 


\  fc=0  / 

where  f  =  ((f°)T, . . . ,  (f2V)T)T.  Thus,  we  obtain  ||U:,+1||o,/i  <  C 

(||U0||o,/i  +  | |f | |oo) -  Therefore,  the  numerical  method  is  unconditionally  stable. 

We  now  show  that  the  numerical  scheme  is  monotone. 


Theorem  4.  (Monotonicity)  The  discretization  scheme  established  in  (7)  is 
monotone  when  At  <  2. 

Proof.  By  rearranging  the  discretizated  equation  (7a),  we  define  a  linear  function 
F('.:  of  Un+1  and  Un  as  follows: 

LiJ 
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r)  TTira+l  /rrn+l  7-rn+l  rrn+l  rrn+1  7-7-n+l  7-7-n+l  r  jn  TTn 

i,j  \+ i,j  ?  i—l,j  5  •  •  •  ?  ^ 0,j  >  *-'/i,j  +  l  5  '-J i,j  —  l  5  ■  *  •  ?  ?  ^-'i+l,j  t  '-+,j  ? 


Tjn  jjn  rrn  TJn  jjn  \ - 

*-+— l,j  j  •  •  •  ?  5  k'ijj+l  7  ^ i,j— 1  ?  *  •  *  ?  '-+,0/  • 


2  +  (miMi  +  M2  g{  +  rAt) 


n+1 


*+l 


jjv- 


+  (m  +  Miso )  ^"+ij  -  (m  -  Mis? )  £++  +  mi  9kU?-k+i,j  +  (m2  +  M2M0 ) 

fc=3 

3+1  r 

-  (jl2  ~  M252)  ui,t- 1  +  M2  9kUi,t-k+ 1  -  2  -  (/ngi  +  /r2fff  +  rAf) 

fc=3  ^ 

*+l 

+  (Mi  +  Mi5o)  C^i+ij  -  (Mi  -  Miff?)  +  Mi  y~! 9kUr~k+i,j 

k= 3 
i+i 

+  (172  +  M2S0)  ^."i+1  -  (m2  -  M2ff2)  +  M2  9kUZi-k+ 1- 

k=3 

We  also  define  the  following  two  functions: 

z^n+l  77171+ 1  / 7-7-n+l  rrn+1  rrn+l  rrn+l  rrn  rrn  rrn  \ 

-M^+e  —  Ui+l,j’  *  ‘  *  ’  U0J  ’  *  ‘  ‘  ’  ui,j+l  ’  ’  *  ‘  ’  ’  u*,0,/ 

^  (t^1,  +  e,  t++ij  ^+1  +  £, 

U^_1  +  e,...,U^  +  e), 

where  £  >  0.  It  has  been  proved  in  [4]  that  ^X]fc=o  Mfe)  —  |gf  >  0  for  i  = 

1,2, . . . ,  Mx  —  1.  This  inequality  also  holds  true  for  {</f}  with  i  +  1  and  Mx 
replaced  with  j  +  1  and  My  respectively.  We  now  use  this  result  to  prove  the 
monotonicity  of  i++1.  When  At  <  we  have  from  the  definition  of  t++1  that, 
for  any  £  >  0  and  feasible  i  and  j, 


r^n+l  _  cm+l  _ 

ri,j,—e  —  ri,j 


i+ 1 


1  -  7,  (miMi  +  M2M?  +  rAt) 


£  +  Mi(So  +S2)e 


j+i 


+  Mi  Mfc  £  +  M2  (flo  +52)£  +  M2^fff£ 

fc=3  fc=3 

<  ++  +  mi  £  +  M2  ~  +1  j  £-  ^  “  drZit)  £  -  F", 


in+1 
■3  ’ 


since  Mi;  M2  <  0.  Furthermore,  from  Lemma 2,  we  know  that  gf  <  0  and  gf  <  0, 
thus  we  have 


rpn+1  _  rpn+1  , 
ri,j,+e  ri,j  ' 


1+2  (mi.9?  +  M2flf  +  rAt^j 


£  >  M, 


n+1 


Therefore,  the  scheme  is  monotone. 

Combining  Theorems  2,  3  and  4,  we  have  the  following  convergence  result. 
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Theorem  5.  (Convergence)  Let  U  be  the  viscosity  solution  to  (1)  and  Uhlth2,At 
be  the  numerical  solution  to  (7)  with  spatial  and  time  mesh  size  triple 
(hi,  hi,  At).  Then,  Uhlth2,At  converges  to  U  as  (hi,  ft-2,  At)  — >  (0+,0+,0+). 

In  [1]  the  authors  show  that  any  finite  difference  scheme  for  a  general  nonlin¬ 
ear  2nd-order  PDE  which  is  locally  consistent,  stable  and  monotone  generates  a 
solution  converging  uniformly  on  a  compact  subset  of  (0,T)  x  i  to  the  unique 
viscosity  solution  of  the  PDE.  In  [5,6],  Cont  and  Tankov  extended  this  result  to 
partial  integro-differential  equations  (PIDEs).  Since  (la)  is  an  PIDE,  Theorems 
is  a  consequence  of  the  results  established  in  [1,5,6]  and  Theorems  2,  3  and  4. 

5  Numerical  Experiments 

We  now  apply  our  method  to  the  following  test  problem. 

Example  1.  Call-on-Min  and  Basket  options:  Eq.  (1),  with  the  system  and  mar¬ 
ket  parameters  a  =  0.25,  r  =  0.05,  K  =  50,  ai  =  =  0.384,  bi  =  b2  =  0.884, 
Zmin  =  2/min  =  ln(O.l),  xmax  =  ymax  =  ln(100)  and  T  =  1.  Initial  and  boundary 
conditions  can  be  obtained  by  setting  t  =  T,  x  =  sm;n,xmax  or  y  =  t/min,2/max 
in  the  following  functions. 

Call-on-Min  option:  U(x,y,t)  =  [: min(ex,ey )  —  JiTe  rtT*  *)]  * 

Basket  option:  U(x,y,t )  =  [(ex  +  ey)/2  —  Ke~r(-T^^]  +  . 

To  solve  this  problem,  we  choose  a  uniform  mesh  with  mesh  sizes  Ax  =  Ay  =  yb- 
and  At  =  ( ^ .  The  numerical  solutions  for  these  options  at  t  =  0  from  our 
method  are  plotted  in  Fig.  1  in  the  original  independent  variable  Sx  =  ex  and 
Sy  =  ev .  From  the  figures  we  see  that  these  numerical  solutions  are  qualitatively 
correct. 


Call-On-Min  Basket 

Fig.  1.  Computed  prices  of  Call-on-Min  and  Basket  options;  a  =  f3  =  1.5 
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To  see  the  influence  of  a  and  f3  on  the  option  prices,  we  solve  the  problem  for 
four  different  values  of  a  =  (3  =  1.3, 1.5, 1.7, 1.9  and  plot  the  differences  between 
the  numerical  solutions  of  the  standard  BS  equation  (i.e.,  a  =  (3  =  2)  and  the 
fractional  BS  equation  and  at  t  =  0  for  Call-on-Min  (Fig.  2)  and  Basket  Option 
(Fig.  3).  From  the  figures  we  see  that  the  Call-on-Min  and  Basket  options  from 
fBS  model  are  more  expensive  than  their  counterparts  of  the  standard  BS  model. 
From  these  figures,  we  also  see  that  the  call  prices  increase  as  a  decreases  when 
S\  and  S'2  are  greater  than  some  critical  values.  This  phenomenon  has  been 
observed  in  published  results  for  of  the  ID  fBS  equation  [2,4]  and  thus  our 
numerical  results  for  the  2D  problem  are  consistent  with  those  from  [2].  The 
figures  also  indicate  that  when  a  and  (3  approach  2,  the  numerical  solutions  to 
the  fBS  equation  approach  to  those  of  the  BS  equation. 


a,  p=1.7  a,  3  =  1.9 


Fig.  2.  Vbs  —  Vfbs  Call-on-Min  option 
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a, /3=  1.3 


a,  /3  =  1.5 


cT 


a,  13  =  1.7 


a,  13=  1.9 


Fig.  3.  Vbs  —  Vfbs  Basket  option 


6  Conclusion 

In  this  paper,  an  FDM  is  proposed  to  solve  the  2D  fractional  Black-Scholes 
equation.  The  discretization  method  is  shown  to  be  unconditionally  stable  and 
convergent.  Numerical  experiments  are  performed  to  demonstrate  the  usefulness 
of  the  methods  for  pricing  two-asset  European  options  of  practical  significance. 
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In  this  paper  we  propose  a  power  penalty  method  for  a  linear  complementarity  problem 
(LCP)  involving  a  fractional  partial  differential  operator  in  two  spatial  dimensions  arising  in 
pricing  American  options  on  two  underlying  assets  whose  prices  follow  two  independent 
geometric  Levy  processes.  We  first  approximate  the  LCP  by  a  nonlinear  2D  fractional  partial 
differential  equation  (fPDE)  with  a  penalty  term.  We  then  prove  that  the  solution  to  the 
fPDE  converges  to  that  of  the  LCP  in  a  Sobolev  norm  at  an  exponential  rate  depending  on 
the  parameters  used  in  the  penalty  term.  The  2D  fPDE  is  discretized  by  a  2nd-order  finite 
difference  method  in  space  and  Crank-Nicolson  method  in  time.  Numerical  experiments 
on  a  model  Basket  Option  pricing  problem  were  performed  to  demonstrate  the  convergent 
rates  and  the  effectiveness  of  the  penalty  method. 

©  2017  Elsevier  Inc.  All  rights  reserved. 


1.  Introduction 

Option  valuation  through  a  partial  differential  equation  approach  has  been  increasingly  attracting  much  attention  from 
financial  engineers,  mathematicians  and  statisticians,  ever  since  the  publication  of  the  two  seminal  papers  [4]  and  [20], 
In  [4]  the  authors  showed  that  in  a  complete  market  the  price  of  an  option  on  a  stock  whose  price  follows  a  geometric 
Brownian  motion  with  constant  drift  and  volatility  satisfies  a  second  order  parabolic  partial  differential  equation,  known 
as  the  Black-Scholes  (BS)  equation.  However,  Gaussian  shocks  used  in  BS  model  often  underestimate  the  probability  that 
stock  prices  usually  exhibit  large  movements  over  small  time  steps  which  can  be  demonstrated  by  empirical  financial  mar¬ 
ket  data.  When  jumps  are  large  and  rare,  a  jump-diffusion  pricing  model  can  be  used  to  capture  them.  More  details  of 
these  models  and  their  numerical  solutions  can  be  found  in,  for  example,  [1,2,13,30,31],  If  there  are  infinitely  many  jumps 
in  a  finite  time  interval,  an  infinite  activity  Levy  process  can  be  used  to  capture  both  frequent  small  and  rare  large  moves. 
It  has  been  shown  in  [6]  that,  the  price  of  an  option  on  a  single  asset  satisfies  a  ID  parabolic  fractional  Black-Scholes 
(fBS)  equation  when  its  underlying  asset  price  follows  a  geometric  Levy  process.  This  ID  fBS  equation  and  the  correspond¬ 
ing  American  option  pricing  problem  can  be  solved  numerically  by  the  numerical  methods  proposed  recently  by  us  in 
[7,8],  In  [10],  Clift  and  Forsyth  proposed  an  implicit  finite  difference  method  for  the  two  dimensional  parabolic  partial 
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integro-differential  equation  (PIDE)  to  price  two-asset  European  and  American  options  whose  assets  follow  the  correlated 
finite  activity  jump  diffusion  model. 

In  this  work,  we  shall  present  a  numerical  method  consisting  of  a  penalty  approach  and  a  discretization  scheme  for 
pricing  American  options  written  on  two  assets  whose  prices  follow  two  independent  geometric  Levy  processes.  Under  the 
same  assumptions  as  in  [6],  it  is  easy  to  show  that  the  value  of  such  a  two-asset  option  of  European  type  (eg.  Rainbow  or 
Basket  Option)  is  determined  by  a  2D  fBS  equation  and  the  value  of  the  corresponding  American  option  is  governed  by  a 
linear  complementarity  problem  involving  the  fractional  partial  differential  operator  used  in  the  European  option  model.  The 
latter  can  also  be  formulated  as  a  fractional  partial  differential  variational  inequality.  We  comment  that,  the  CGMY  jump- 
diffusion  process  [5]  is  also  popular  in  option  pricing.  An  fBS  equation  for  pricing  European  options  has  also  been  developed 
in  [6].  However,  in  the  present  work,  we  only  consider  the  fBS  equations  and  inequalities  associated  with  the  geometric 
Levy  process  and  will  develop  algorithms  for  the  fractional  differential  LCPs  based  on  the  CGMY  jump-diffusion  process  in 
a  future  paper. 

Penalty  approaches  have  been  used  very  successfully  for  solving  constrained  optimization  problems.  In  recent  years, 
penalty  methods  have  been  used  for  complementarity  or  variational  inequality  problems  in  both  finite  and  infinite  dimen¬ 
sions  [3,25,35],  particularly  those  from  the  valuation  of  financial  options  [15,18,19,22,27,33,34,36],  Recently,  modern  opti¬ 
mization  techniques  such  as  the  use  of  grossone  theory  proposed  in  [24]  in  nonlinear  programming  problems  with  differ¬ 
entiable  penalty  functions  to  determine  the  penalty  parameters  has  been  developed  in  [11].  In  [8],  we  proposed  a  power 
penalty  method  for  solving  the  fBS  equation  governing  single-asset  American  option  pricing.  In  this  paper,  we  construct  and 
analyze  a  power  penalty  method  for  the  fractional  differential  complementarity  problem  arising  in  pricing  the  aforemen¬ 
tioned  two-asset  American  options.  In  particular,  we  will  establish  a  convergence  theory  for  the  penalty  method  proposed. 
We  will  then  propose  a  2nd-order  accurate  scheme  for  the  discretization  of  the  2D  nonlinear  fBS  equation  in  two  spatial 
dimensions  generated  by  the  penalty  method,  based  on  our  recent  work  in  [7]  for  the  ID  fBS  equation  arising  in  pricing 
one-asset  options. 

While  the  numerical  solution  of  fractional  differential  LCPs  and  fBS  equations  arising  in  pricing  options  written  on  one 
risky  asset  has  been  discussed  in  various  existing  works,  to  our  best  knowledge,  there  are  no  numerical  methods  for  their 
2D  counterparts  governing  the  valuation  of  options  on  two  assets.  Therefore,  the  present  work  will  fill  this  gap  and  provide 
a  numerical  tool  for  pricing  European  and  American  options  of  the  aforementioned  type. 

The  organization  of  this  paper  is  as  follows.  In  the  next  section,  we  will  give  a  brief  account  of  the  fBS  equation  and  frac¬ 
tional  differential  LCP,  along  with  their  initial  and  boundary  conditions,  governing  the  valuation  of  European  and  American 
options  written  on  two  independent  risky  assets.  We  will  also  formulate  the  LCP  as  a  variational  inequality  and  show  that 
the  latter  problem  is  uniquely  solvable.  In  Section  3,  we  will  first  propose  the  power  penalty  method  with  positive  penalty 
parameters  X  >  1  and  k,  and  consider  the  unique  solvability  of  the  penalty  equation.  We  will  then  prove  that  the  solution  to 
the  penalty  equation  converges  to  that  of  the  variational  inequality  at  the  rate  0(X~k/2).  A  2nd-order  accurate  discretization 
scheme  is  proposed  in  Section  4  for  the  penalty  equation.  In  Section  5,  we  will  present  some  numerical  experimental  results 
using  an  American  Basket  option  pricing  problem  to  numerically  demonstrate  the  rates  of  convergence  and  usefulness  of  the 
numerical  method. 


2.  The  option  pricing  problem 


It  is  shown  in  [6]  that  the  value  of  an  option  whose  price  follows  a  geometric  Levy  process  is  governed  by  a  ID  fBS 
equation.  Under  the  same  assumptions  as  in  [6],  it  is  trivial  to  show  that  the  value  U  of  a  European  option  written  on 
two  assets  (eg.  Rainbow  or  Basket  Option)  whose  prices  S}  and  S2  follow  two  independent  geometric  Levy  processes  is 
determined  by  the  following  two-dimensional  fBS  equation: 

CU  :=  -Ut  +  a,Ux  +  a2Uy  -  t>i[_ooD“U]  -  b2[^U]  +  rU  =  0  (la) 


for  (x,y,  t)  e  (-oo,  oo)2  x  [0,  T ),  where  x  =  lnS^  y  =  lnS2,  -<x>D%U  and  -oaDyU  denote  respectively  the  ctth  and  /1th  deriva¬ 
tives  of  U  in  x  and  y  for  a,  e  (1,  2),  T  >  0  is  the  expiry  date,  r  >  0  is  the  risk-free  rate,  a  >  0  is  the  volatility  of  the 
underlying  asset  price,  and 


tq  =  -r  -  -er  sec 


a2 


m 

bt  = 

cr“  sec  ^ 

(tL\ 

— 

1  R 

_ rr  >  cpr 

(p* 

V  2  j 

7  u2  — 

2 

(  2 

Boundary  and  terminal  conditions  can  be  defined  for  the  above  equation  depending  on  the  types  of  options  and  the  strike 
price  K.  From  the  transformations  x  =  lnS]  and  y  =  lnS2,  we  have 

lim  Ux=  Iim  Us.ex  =  0,  lim  Uv  =  lim  USley  =  0, 

x— oo  x— ► — oo  y->- oo  y-K—oo 

since  USj  and  US2  are  bounded  as  S1,S2-»0+  in  practice.  In  computation,  the  infinite  solution  domain  (-oo.oo)2 
has  to  be  truncated  into  £2  =  (xmin,xmax)x  (ymin,ymax),  where  xmin,  xmaXl  ymin  and  ymax  are  four  constants  satisfying 
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xmin,  ym in  <  0  and  Xmax.  ymax  >  0.  Therefore,  since  both  Ux  and  Uy  go  to  zero  exponentially  as  x  and  y  approach  — oo, 


when  xmin,  ymin  <  0,  the  following  conditions  for  (la),  up  to  a  truncation  error,  are  satisfied: 

Ux(x,y,t)  =  0,  x<xmin,  (y.t)  g  (ymin,ymax)  x  [0,  T),  (lb) 

Uy(x,y,t)  =  0,  y  <ymin,  (x,t)  g  (xmin,xmax)  x  [0,  T).  (lc) 

Clearly,  for  put  options,  the  strike  price  K  should  satisfy  max(e*min,  eymin)  <  K  <  min(e*max,  eJ'max). 

As  in  the  conventional  case,  the  two-asset  American  option  price  satisfies  the  following  fractional  differential  linear  com¬ 
plementarity  problem 

CU  >  0,  U  >  U *,  (2a) 

CU  ■  (U  —  U*)  =  0,  (2b) 


where  U*  is  a  given  function  of  (x,  y,  t)  defining  a  ‘lower  bound’  on  the  solution  which  is  usually  the  payoff  function  of 
the  pricing  problem.  Note  that  (2a)  and  (2b)  contain  (la)  as  the  special  case  when  U*  <  0.  This  is  because  U  >  0  and  thus 
the  2nd  inequality  in  (2a)  is  always  satisfied  if  U*  <  0.  In  this  case,  the  complementarity  condition  (2b)  yields  (la).  In  what 
follows,  we  only  consider  pricing  American  puts  as  the  price  of  an  American  call  is  equal  to  that  of  its  European  counterpart. 
For  brevity,  we  assume  in  the  rest  of  this  work  that  U*  is  the  payoff  function  of  the  problem. 

On  the  boundary  of  £2,  we  impose  the  following  boundary  and  terminal  conditions  for  an  American  put: 

U(Xmin,y,t)  =  gl(y,t),  U(X,ymin,t)  =g2(x,t),  U(xmax,  y,  t)  =  0  =  U(X,  ymax,  t)  (2c) 

for  (x,  y,  t)  g  £2  x  [0,  T),  and 

U(x,y,T)  =U*(x,y),  (x,y)  g  12,  (2d) 

where  gj ,  g2  are  given  functions  and  U*  is  the  payoff  of  the  option  defined  below. 

For  brevity,  we  only  consider  two-asset  American  Basket  options  in  this  work  whose  pay-off  function  is 

U*(x,y)  =  [/<'- wqe*  -  w2ey]+,  (3) 

where  [z]+  =  max{z.  0}  and  w1:  w2  >  0  are  weights.  Clearly,  the  weights  are  arbitrary  as  long  as  (2c)  and  (2d)  are  consistent. 
We  also  assume  that  the  computational  domain  £2  is  sufficiently  large  so  that  K  -  w*e  -  w*e  =  0  is  a  curve  in  the  interior 
of  £2. 

It  is  normally  not  possible  to  derive  explicit  analytical  expressions  for  the  boundary  conditions  g]  and  g2  in  this  case,  as 
they  are  usually  the  solutions  of  one-dimensional  American  option  pricing  problems  or  LCPs  of  the  form  (2a)  and  (2b).  In 
practice,  numerical  approximations  to  these  ID  American  option  pricing  problem  are  sought  as  discussed  in  16,17], 

To  determine  glt  one  needs  to  solve  a  ID  LCP  obtained  by  taking  the  limit  of  (2)  asx->  xmin.  Using  (lb),  we  see  that  g ^ 
should  satisfy  the  following  ID  LCP: 

-git  +  °2giy  -  b2[-°°Dy St  1  +  rgi  >  0. 

■  gi  >  LD(xmin,y),  (4) 

(-git  fiazgiy-fazf-ooDygi]  +rgi)  •  (g,  -U*(xmin,y))  =  0, 

with  the  boundary  and  terminal  conditions 

gi(ymin,t)  =  U*(xmin,ymin),  gi(ymax,t)  =  0,  gi  (y,  T)  =  u*(xmin,y).  (5) 

According  to  29],  the  upper  bound  of  the  asset  prices  are  usually  three  to  four  times  the  strike  price.  Choosing  a  reasonable 

large  upper  bound,  we  can  have  the  above  artificial  boundary  conditions  at  (ymax ,  t). 

Similarly,  g2(x,  t)  is  determined  by  the  following  LCP: 

I— g2t  +  a2g2y  -  bi  [_ooD“  g2  ]  +  rg2  >  0, 

g2  >  U*(x,ymin),  (6) 

(-g2t  +  a2g2y  -  bi[-ooD“g2]  +  rg2)  ■  (g2  -  U*(x,ymin))  =  0. 
with  the  boundary  and  terminal  conditions: 

g2(xmin.f)  =  U*(xmin,ymin),  g2(xmax,  t)  =  0,  g2(x,  T)  =  U*(x,ymi„).  (7) 

Both  (4)  and  (6)  are  single-asset  American  option  pricing  problems  with  fractional  Black-Scholes  operators.  Note  that 
the  boundary  and  payoff  conditions  (5)  and  (7)  are  exact.  The  above  ID  problems  can  be  solved  numerically  using  the 
discretization  and  penalty  methods  proposed  in  [7,8]  to  yield  approximations  to  gj  and  g2.  The  computational  errors  in  the 

numerical  solutions  of  the  boundary  conditions  are  of  the  order  0(h2  +  At2  +  7.fe/2)  as  proved  in  [7,8],  where  h  and  At  are 

respectively  the  maximal  mesh  sizes  in  space  and  time,  and  X  >  1  and  k  >  0  are  the  penalty  parameter  and  power  used  in 
the  power  penalty  method. 
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There  are  various  representations  of  the  fractional  derivative  _ooD"L/(x,  y)  such  as  those  of  Riemann-Liouville  and 
Griiwald-Letnikov  [21,23].  For  a  given  x0,  one  form  for  XoD%V(x,y)  is 

U(x0,y,t)  ,  Ux(x0,y,t) 


XoDaxU(x,y,t)  = 


l -a)  L 


xUxx(^,y ,  t)Ah 


r(l-a)(x-x0)a  T(2  -o')(x-x0)“-'  F(2  —  < 

for  x  >  x0,  where  T(  )  denotes  the  Gamma  function.  Using  (lb)  it  is  easily  seen  that,  for  x0  <  xmin,  (8)  reduces  to 
0 Cornin'  0 


(8) 


x0DxU(x,y,  t)  = 


^ L 


rn  -a)(x-x0)a  r(2—i 

since  Ux(x,y,  t)  =  0  and  U(x0,y,  t)  =  U(xmjn,y,  t)  when  x  <  xmin  (up  to  a  truncation  error).  Therefore,  we  have 

r(2-< 


_oo  DxU(x,y,t)  =  lim 

Xq— C 

: 

 nT- 


xo  *  oo  I  r (1  —  a)(x  -  x0) 

—  [ 

-a)  JXl 


(x-H) 

min  (uf 

!  -  a)  Jx, 


x  Uxx($,y:  t)  A>_ 


(9) 


(*-?)“ 

for  x  >  xmjn.  This  is  Caputo’s  representation  of  the  ath  derivative  of  our  solution  U  with  respect  to  x.  Similarly,  using  (lc), 
we  can  derive,  for  y  >  ymin  and  up  to  a  truncation  error, 

ry  Uyy(x,^t). 


-<xDyll(x,y,  t)  = 


—  f 


rv-P)Jy^(y-W 


2.1.  The  variational  formulation  and  unique  solvability 


In  this  section,  we  first  formulate  (2)  as  a  variational  inequality  problem  and  then  show  that  the  problem  has  a  unique 
solution.  We  start  this  discussion  by  introducing  some  function  spaces. 

For  the  open  set  £2  c  R2  and  1  <  p  <  o o,  we  let  LP(£ 2)  =  { v  :  (fQ  \v\pdd)'l/p  <  oo}  denote  the  space  of  all  p-power  inte- 
grable  functions  on  £2  equipped  with  the  usual  Lp-norm  |[  ■  ||lp(S2)-  We  use  (•,  •)  to  denote  the  usual  inner  product.  For  any 
K  =  [Ci.?2]6  (0,  l]2.  we  let 

Hf(R2)  \={v:v,  -cx;Dx  vand_xD^v  e  L2 (M2 ) } . 


On  (R2)  we  introduce  an  energy  norm 


K  = 


Hl2(e2)  ‘ 


n^if  II2 

-ooux  i/IIl2(R2)  ‘ 


•  ||^  such  that  for  any  v  e  (1R2), 

II  D^v  II 2 
11-00  y  ^IIl2(R2)- 


(10) 


It  has  been  shown  in  [12]  that  (R2)  equipped  with  ||  ■  ||^  is  a  Sobolev  space. 

Similarly  to  (R2),  we  also  define  the  Sobolev  space  of  functions  having  a  support  on  £2  =  (xmin,  xm;ix)  x  OTnin.ymax) 
given  by 


H0f(£2)  =  {p:peHf(£2),p|8S2  =  0} 


with  the  energy  norm  defined  in  (10)  (with  R2  replaced  with  £2),  where  xmjnDx1  u  and  yminDy2u  are  defined  in  (8)  with  x0 
and  y0  replaced  with  xmin  and  ymin  respectively  and  3  £2  denotes  the  boundary  of  £2.  In  what  follows,  we  also  use  (■,  ■) 
to  denote  the  duality  pairing  between  H^(£2)  and  its  dual  space  (i 2)  defined  by  (v,  w)  =  fnvwdQ  for  ve  H^(£ 2)  and 

w  e  HQ  f  (£2). 

We  first  rewrite  the  operator  in  (la)  as  the  following  conservative  form: 

CU  =  —Ut  —  V  ■  (~aU  +  BV(^U)  +  rU, 

where 


Letting  y  =  [a,  /3]  and  f  =  [a  —  1,  ji  —  1],  we  define 


V(<>U  = 


3“-i U  dP-'U 
ax"-1  '  dyP~^ 


Let  U0  e  H2(£ 2)  be  a  known  function  satisfying  the  boundary  conditions  (2c).  (For  example,  U0  can  be  the  solution  of 
a  bi-harmonic  equation  satisfying  (2c)  and  the  homogeneous  Neumann  boundary  condition.)  Then  we  introduce  a  new 
function 


u(x,y,  t)  =  U0(x,y)  -  U(x,y,  t). 


(11) 
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Taking  £U0  away  from  both  sides  of  (2a)  and  using  (11),  we  have 
f  Cu<f. 

|  u  <  u*.  (12a) 

l  CCu  -  /)  (u-u*)  =  0 

for  feasible  f  and  (x,  y)  with  the  boundary  and  terminal  conditions: 

u(x,y,t)|8n  =  0,  u(x,y,T)  =  u*(x,y),  (12b) 

where  f(x,y)  =  CU0(x.y)  and  u*(x,y)  =  U0(x,y)  -  U*(x,y). 

We  now  define 

K,  =  jv(t)  :  v(t)  e  H^/2  (Cl).v(t)  <  u*(t)  almost  everywhere  in  (0,  T)}. 

It  is  easy  to  verify  /C  is  a  convex  and  closed  subset  of  Using  this  convex  set,  we  pose  the  following  problem: 

Problem  2.1.  Find  ueK,  such  that,  for  all  v  e  K,, 

3“  ' .  (.3) 


-gpV-  u)+A(u,  v-  u)  >  (f,  v-u). 


almost  everywhere  (a.e.)  in  (0,  T),  satisfying  the  boundary  and  terminal  condition  (12b),  where  /!(■,  ■)  is  a  bilinear  form 
defined  by: 

A(u,v)  =  (Vu.av)  +  (BV(^u,Vv)  +  r(u,v),  u,i/eHj/2(fi).  (14) 

it  can  be  easily  shown  that  Problem  2.1  is  the  variational  form  of  (12a). 

In  [7]  (also  [12]),  we  have  proved  the  following  lemma. 

1.  For  A(u.v)  =  ai3u  ”v  1  h/  Sv 

i;,weHj/2(l),  a  e  (1,  2). 

A(v,  f)  >C,||p||2/2, 

A(v,  w)  <  C2|[v||a/2|[w|[a/2, 
for  all  t  e  (0,  T)  a.e. 

Using  this  lemma,  we  can  derive  the  following  lemma. 

Lemma  2.3.  There  exist  two  positive  constants  Cf  and  C2.  such  that  for  any  v,  w  e  H^/2(£2), 

A(v,v)>q\\v\\2y/2,  (17) 

A(v,w)  <C|||v||y/2||w||x/2,  (18) 

for  t  e  (0,  T)  a.e. 

Proof.  Let  C  be  a  generic  positive  constant.  Using  (15)  and  Cauchy-Schwarz  inequality,  we  have,  for  u.v  e  H^/2(C2), 


Lemma  2.2.  For  A(u.v)  =  a(^.v)  +  b(XminDx  ’u,  If)  +  r(u,  v),  there  exist  positive  constants  C\  and  C2,  such  that  for  any 


(15) 

(16) 


dv 


dv 


A(v.v)  =  a,(^z,v)  +  bt(Xn]iDax-]v.  _ )  +  a2(  p)  +  b2(ymmDpy  v,^)  +  r(v,  v) 


dv 


3  y' 


dv 


dx’ 

>  C,  ||v||S/2 +C2||v|||/2 

i  C(||v||„/2  +  I|p|| ft /l) 

>  C||p|[2/2. 

Similarly,  using  (16)  and  Cauchy-Schwarz  inequality,  we  can  have 


dy 


dv 


dw 


dx 


A(v.  w)  =  ai(  w)  +  fa,(XmmDr'p,  )  +  a2[  w)  +  b2(ymmDp-  v,^)  +  r(v,  w ) 


dv 


dy’ 


dw 


dy 


S  Cl  ||u||a/2  ||w||a/2  +C2||p||^/2llwll^/2 


<  C 


2 

a/2  ' 

1 2  I 

<y/  2 1 


iii,II|/2)1/2(II  w"2 


v  1/2 


a/2  ' 


M/2/ 


II 2 

lly/2- 


□ 


Using  Lemmas  2.2  and  2.3,  we  are  able  to  prove  the  following  theorem. 
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Theorem  2.4.  There  exists  a  unique  solution  to  Problem  2.1. 

This  theorem  is  just  a  consequence  of  Lemma  2.3  and  Theorem  1.33  in  [14],  in  which  the  unique  solvability  for  an 
abstract  variational  inequality  problem  is  established.  Thus,  the  proof  is  omitted  here. 

To  conclude  this  section,  we  comment  that  the  transformation  (11)  is  necessary  only  for  theoretical  discussions.  It  is  not 
necessary  to  use  (11)  in  practical  computations. 


3.  Penalty  method  and  convergence 


Penalty  methods  have  been  used  successfully  for  solving  conventional  constrained  optimization  problems.  In  this  section 
we  will  propose  such  a  penalty  method  for  (12a)  and  (12b).  We  then  establish  a  convergence  theory  for  the  penalty  method. 
The  penalized  equation  to  solve  American-style  option  pricing  problem  is  given  below: 

Cux(x.y,  t)  +  A.[uA(x,y,  t)  -  u*(x,y)]yk  = /(x,y),  ( x.y.t )  e  £2  x  (0,  T)  (19a) 

satisfying  the  following  boundary  and  terminal  conditions: 

UA.(x.y,t)|9£2  =  0,  ux(x,y.  T)  =  u*(x,y),  (19b) 

where  X  >  1  and  k  >  0  are  penalty  parameters.  The  variational  form  of  (19)  is  as  follows. 

Problem  3.1.  Find  ux(t)  e  H^/2(£2)  satisfying  the  initial  condition  in  (19b),  such  that,  for  all  v  e  H^/2(C2), 

(-^t p--vj+AM),  v)  +  (X[Ui(t)  “  “if.  v)  =  (/.  v)  (20) 

for  t  e  (0,  T)  a.e.,  where  A(-,  ■)  is  a  bilinear  form  defined  in  (14). 

Theorem  3.2.  Problem  3.1  has  a  unique  solution. 

Proof.  To  prove  this  theorem,  it  suffices  to  show  that  the  nonlinear  operator  on  the  LHS  of  (20)  is  strongly  monotone  and 
continuous.  Since  the  linear  part  A  of  the  LHS  of  (20)  is  coercive  by  (17)  and  the  nonlinear  penalty  term  in  (20)  is  clearly 
monotone,  the  operator  is  strongly  monotone. 

From  (18)  we  see  that  A(ux.v)  is  Lipschitz  continuous  in  both  ux  and  v.  Also,  it  is  obvious  that  the  nonlinear  term  is 
continuous  in  both  uA  and  v.  Therefore,  Problem  3.1  is  uniquely  solvable  by  the  standard  result  in  [14,  p.  37],  For  a  more 
rigorous  proof  of  this  theorem,  we  refer  to  Theorem  3.2  of  [8],  □ 


We  now  show  that  the  solution  to  Problem  3.1  converges  to  that  of  (12a)  as  the  penalty  parameters  X  or/and  k  -*  00  in 
a  proper  norm.  Before  further  discussion,  it  is  necessary  to  introduce  the  usual  Hilbert  space  in  space  and  time  given  by 

Lp( 0,  T ;  H(Q))  :=  {v(-.  ■;  t)  :  i/(.,  •;  t)  e  H(C2)  a.e.  in  (0,  T)  |  |u(,  •;  t)||H(n)  e  Lp(( 0,  T))} 

with  the  norm 

fT  \  VP 


IK-.  •; f)  |[lp(o, =  (J^  ll“(-.  •; 


where  H(f2)  denotes  a  Hilbert  space  on  with  the  norm  ||  ■  ||h(s2)-  Using  this  space  we  present  the  following  lemma. 

Lemma  3.3.  Let  ux  be  the  solution  to  Problem  3.1  and  assume  that  u x  6  Lp(f 2  x  (0,  T)),  where  p  =  1  +  1/fc.  Then  there  exists  a 
positive  constant  C,  independent  of  ux  and  X,  such  that 


II  [“a  U*]+lllP(nx(0,T))  -  JJ’ 

C 

l|[“A_U  1+  IIl“.(0,T;Z.2(Q))  II  [“A  “  ]+lll.2(0,7-;H>72(Q))  -  Vk/2  ' 


(21) 


(22) 


HO.T.H^(t2))  -  ^k/2' 

Proof.  Let  C  be  a  generic  positive  constant,  independent  of  ux  and  X.  To  simplify  notation,  we  put  4>(x.y,  t)  =  [u^(x,y,  t)  — 
u*(x,y)]+.  It  is  easy  to  see  that  </>(•,  ■,  t)  e  H^/2(Q)  for  t  e  (0,  T)  a.e.  Thus,  setting  v  =  cp  in  (20),  we  have 


=  (/-0)  a-e- in  (°'T)- 
Taking  — (3j£,  <p}  +  A(u*,  cp)  away  from  both  sides  of  the  above  equality  gives 


d(ux-u*) 
dt 


.(p)+A(ux  -u’,0)  +  A(01/fe,0)  =  (f,<p)  +  [-^r.(p)  -A(u*,(p), 


du* 
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(-^.0|  +  ^(0.0)+^(01A.0)  =  (/,0)  -A(U*,0), 


since  0  =  0  when  ux  —  u*  <  0  and  =  0. 

Note  0(x,y,  T)  =  [u^(x,y,  T)  —  u*(x,y)]+  =  0  by  (19b).  Integrating  by  parts  gives 


l  ^  |-0( z), 

vhich 

r( 


30(0 

3 1 


dr 


from  which,  we  get 

-■^pl,0(r)|dr  =  i(0(t),  0(0)- 

Integrating  (23)  from  t  to  T  and  using  (24),  (15)  and  Holder  Inequality  ,  we  obtain 

^rnur  / 


1(0(0,  0(0  )  +  cft  U(z)\\2y/2dT+xjt  ||0(r)|| 

J  (/(t),0(t))dr A(u*,<p(z))dz 
C(l  l^(T)Hw(£2)dT^  ~  l  A(u* ,  0(r))dr. 


p  dr 


From  the  definition  of  /!(■,  ■)  in  (14),  we  see  that  the  integrand  of  the  last  term  in  (25)  is 
—A(u*,  0(r))  =  (au*  +  BV(^u*,  V0)  +  r(u*,  0). 

By  Green’s  theorem,  we  have 

-  /  (au*,V0)dr=  /  /  V  ■  (au*)cf>(x,y,  z)dxdydz  -  /  /  (au*  ■  n)(f>(x,y,  z)dxdydz 

Jt  Jt  Jn  Jt  Jan 

(  [  dxdycp(x,y,  z)dz  <  Cl  f  ||0(i 

Jt  Jci  \Jt 


<  c 


llinn)1- 


i/p 


(23) 


(24) 


(25) 


because  LI*  and  V  ■  (au*)  are  both  bounded  on  L2,  where  n  denotes  the  unit  vector  outward-normal  to  3L2. 

Let  =  {(x.y)  e  L2  :  K  -  -  w2ey  >  0}  and  L22  =  L2  \  such  that  U*(x,y )  =  0  on  L22.  We  also  let  r0  be  the  inter¬ 
face  of  and  L22  so  that  r0  has  two  opposite  orientations:  which  is  oriented  in  the  same  direction  as  3L2i,  and  r~ 

which  is  oriented  in  the  same  direction  as  3L22.  Since  0  =  0  on  r0,  we  have,  using  integration  by  parts, 


-(BV(^u*,  V0) 


-  f  (BV(f ,u*)rV0dxdy  -  f  (BV(f,u,t)rV0dxdy 

J  fi-j  J  ^2 

[  V  •  (B'V^)u*)(pdxdy  -  I  BS/^^u*  -ncpds 
in,  ir+ 

+  f  V  ■  (BV(f  *u*)0dxdy  -  f  BV(^u*  ■  ncpds 

J  ^2  J  fg 

-  [  BCV^'iu^  - ■  rupds +  ’S~'  [  V  ■  (BV<? )u*)0dxdy, 

Jr*  £i  in, 


(26) 


where  n  is  the  unit  outward  normal  direction  of  the  boundary  and  VuL  and  Vu*  denote  the  value  of  Vu*  on  the  left  and 
right  sides  of  T+  respectively.  Since  u*  =U0-  LI*, 

V<Ou*±  =  V<Ou0±-V((>U*. 

Note  that  U0  e  H2(fi),  V^LI0  is  continuous  in  L2.  From  this  and  (3)  we  have 
V<G>u*  -  V(^u*  =  V(t:)U;  -  V(t>U!  =  -(w:ex,w2ey)T, 
since  V^U*  =  0.  Since  T0  is  characterized  by  K  -  w-^e*  -  w2ey  =  0.  the  unit  vector  outward-normal  to  is 

V (I<  -  Wje*  -  w2ey)  (— Wje*,  -w2ey)r 

||  V  (/C  —  Wie*  —  w2e^)||  (w^e2x  +  w^e2^)1/2  ’ 

Based  on  the  above  results,  (26)  has  the  following  upper  bound: 
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-(BV«  VV0)  <  -  f  (^e\w2ey)TB(w^,W2 
k  Jr*  (w*e2x  +  w\e2yyi2 

Z  [  ipdxdy, 

Jn 


B  is  positive-definite,  (f>  is  non-negative  and  V  ■  (BV^u*)  is  bound 
fore,  replacing  the  last  term  in  (25)  by  the  above  upper  bound  gives 


since  B  is  positive-d 
Therefore,  replacing  _ 

i(0(t),0(t))  +  Cjf  \\<p(T)\\2y/2dz - 

for  all  t  e  (0,  T)  a.e.  This  implies  that 

*jf  M(T)\\lwdT  <C^r||0(r)||  tP(£2)^TJ 


is  bounded  above  on  both 

inrl  cri'w’pc 


'T  U{x)\\l(a)dx 


l/P 


and  0.2  from  its  definition. 

(27) 


\  i/p 

P  ,*)  • 


and  so 


(jt  H0(r)lliPP(n)dT 


i-i  /p 

<  Ck  l . 


From  the  choice  of  p  we  see  that  1  -  1/p  =  l/(kp).  Thus,  from  the  above  estimate  we  have 


This  is  (21).  Combining  (27)  and  the  above  estimate  yields 
^(0(t),0(t))  +  ^  ||0(r)||2/2dr  <  ^ 

for  any  feasible  t.  Finally,  noting  that  t  is  arbitrary,  the  above  inequality  implies  (22).  □ 

Using  Lemma  3.3,  we  are  able  to  prove  the  following  theorem. 

Theorem  3.4.  Let  u  and  ux  be  the  solutions  to  Problems  2.1  and  3.1,  respectively.  If  e  L1+,<(f2  x  (0,  T)),  then  there  exists  a 
constant  C  >  0,  independent  of  X,  such  that 

C 

II  -  Ullt“(0,T;t2(S2))  +  II  “A.  -  Ulli.2(0,T;H>'/2(£2))  -  Jk/2’  (28) 

where  X  and  k  are  the  parameters  used  in  ( 19a ). 

Proof.  Following  the  notation  used  in  the  proof  of  Lemma  3.3,  we  decompose  u  -  ux  as 

u-ux  =  u-u*  +  [ux-  u*]_  -  [ux  -  u*]+  =:  Rx  -  tj>,  (29) 

where  [z]_  =  -  min{z,  0}  for  any  z  and 

Rx  =  u  -  u*  +  [14  -  u*]_.  (30) 


Let  us  first  consider  Rx.  Setting  v  =  u  —  Rx  in  (13)  and  v  =  Rx  in  (20)  gives 

l-~,-R,}l  +  A(u,-Rx)>(.f,-Rx). 

l-^,Rx\+A(Ux,  Rx)  +  X(^k.  Rx)  =  (/.  Rx). 

Adding  up  the  above  inequality  and  equality,  we  have 

(_  8(Ug  ~  — ,  Rx)  +Mux  -  u.  Rx)  +  X{^k.  Rx)  >  0.  (31) 

From  their  definitions,  it  is  easy  to  see 

4>Vk[Ux  ~  «*]-  =  [u;.  -  u*]]!k[ux  -  u*]-  =  0.  (32) 

Thus,  using  the  above  relationship  and  (30),  we  have 

Rx )  =  (01/,c,  u  -  u*  +  [ux  -  u*]_)  =  u  —  <  0, 
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since  0  >  0  and  u  -  u*  <  0  by  (12a).  Therefore,  (31)  reduces  to 
d(u-ux) 


dt 


,Rx)+A(u-ux,Rx)  <0. 


Using  (29),  it  is  easy  to  see  that  the  above  inequality  can  be  rewritten  as 


dRx 

dt 


,  Rx  )+A(Rx,  Rx)  <  (  -M,  Rx  >  +4(0,  Rx). 


From  (30)  we  see  that  Rx(x,T)  =  0.  Thus,  integrating  both  sides  of  the  above  estimate  from  f  to  T  and  using  the  same 
argument  as  for  (24),  Cauchy-Schwarz  inequality  and  (18),  we  have 

rT 


l(Rx(t),Rx(t))  + ^  A(Rx(t),Rx(t))c1t 
~  1  V  ]  ■  R^(zI)dT  + 1  M<t>(r),Rx(r))dr 


m)<RX(t))+  n<Kx). 


dRx( r) 
3r 


dr  ■ 


-f  A(<p(x),  Rx(r))dz 


lli“(0,T;t2(n))ll^  llt“(0,r;L2(n))  +  ^'ll<^lll2(0,r:Hj/2(t2))ll^llL2(0,r;H»'/2(n)) 


(33) 


for  all  t  e  (0,  T).  Using  (32),  (30),  and  (21),  we  estimate  the  last  term  in  (33)  as  follows: 


I  9^r))dT  =  [  ^jP~)dx  -  CII^Htr(S2x(o,r)) 


du 

dt 


Xk’ 


L<?(fix(o,r)) 

where  p  =  1  +  \/k  and  q  =  1  +  k.  Substituting  the  above  upper  bound  into  (33)  and  using  (16),  (15)  and  (22),  we  obtain 
(||^a|Il~(0,T;I2(£2))  +  l|KJi.2(o 
<c(\\m2L  °°(0,T;L2(C2))  +  II^IIl22(o  ,T;HY/2(Q))^ 

—  c(||0||L«,(OiT;12(Q))||R^||1K,(O,r;L2(£2))  +  H0lll2(O,T;Hj/2(n))  II^A,  IIl2(0,T;H’(/2(£2))  +  ^ 

—  ^[(  II  ^  llt.“(0,T;L2(S2))  +  ll</’llt2(0,T;Hl'/2(n)))  '  (  II  1 1  (0,T;L2  (£2))  +  II^X  lli2(P,r;H>'/2(£2)))  +  ^  *] 

<  C[A  k/2(||^A.I|y(o,T;t!(n))  +  II^Alli2(o,r;H)'/2(S2)))  +  ^  *]• 

This  is  of  the  form  p2  <  C(pX~k/2  +  X~k).  It  is  easy  to  prove  that  p  <  CX~k/ 2  for  a  generic  positive  constant  C,  independent 
of  X  and  k.  Therefore,  we  have 

II^Alli»(o,r;i2(S2))  +  ll^lli2(o,r;H>-/2(S2))  5  CA  k/2-  (34) 

Finally,  using  the  triangular  inequality,  (29),  (22)  and  (34),  we  can  have 

llU  “  Ut,lli=o(0,r;t2(n))  +  llU  -  Uxlli2(0,T;H>'/2(r2))  £  (ll^lli»=(0,T;t2(t2))  +  II  II L2  (0,T;Hr/2  (fi))) 

+  (ll0lll~(O,r;i.2(fi))  +  H0lll2(O, T;H//2(fi)))  <  CX  k/2 . 

This  is  (28).  □ 

4.  Discretization 


Since  the  penalized  fPDE  cannot  be  solved  analytically,  it  needs  to  be  discretized  in  order  to  solve  it  numerically.  Various 
discretization  methods  are  available  in  the  open  literature.  In  this  section,  we  apply  the  discretization  technique  developed 
recently  in  [7]  to  the  fractional  derivatives  in  (19a).  We  also  use  Crank-Nicolson  time  stepping  method  to  construct  the 
discretization  scheme  for  the  penalized  equation  of  (2). 

Let  the  intervals  (xmin ,  xmax )  and  (ymin ,  ymax )  be  divided  into  Mx  and  My  sub-intervals  respectively  with  mesh  nodes 

Xi  =  Xmin  +  ihx,  i  =  o,  1 . Mx ;  y}=  ymin  +  jhy ,  j  =  0,1 . My, 

where  hx  =  (xmax  —  xmin)/Mx  and  hy  =  (ymax  -ymin)/My.  The  or-th  partial  derivative  defined  in  (9)  can  be  approximated  as 
follows  [7]: 


,  -a  i+1 

r(2  -  O')  Y.£kUi-MJ 

v  ’  k= 0 


(35) 
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for  any  i  e  {1, 2, . . . ,  Mx  —  1}  and  j  e  {1,2, _ My  -  1},  where  j  =  U(xi_j<+i,yj,  t).  The  coefficients  g^’s  are  given  by 

1  2?~a  —  4  ^  33"“  -  4  x  23"“  +  6 

“  (2  -  a)(3  -  a)  ’  ^  ~  (2-ot)(3-a)'  &  ~  (2-a)(3-a)  ’ 

fk  =  (2-g)1(3-a)[(k+1)3~g  “  4,<3"“  +  6(,<  -  1  )3'“  -  4('<  -  2)3'“  +  ( k  -  3)3"“]' 


for  k  =  3,  4 . i  +  1.  This  finite  difference  scheme  has  second  order  accuracy  as  proved  in  [7]. 

For  a  positive  integer  N,  let  (0,  T )  be  divided  into  N  sub-intervals  with  the  mesh  points  tn  =  T  —  nAt,  n  =  0. 1 . N, 

where  At  =  T/N.  Thus  T  =  t0  >  U  >  •  •  •  >  tN  =  0.  Using  (35)  we  define  the  following  finite  difference  operators  for  the  frac¬ 
tional  derivatives  in  (la): 


i+l 


SaU =  — 
x  ‘-J  h£T(2  - 


3^7  EOT, 


k+  l,j> 


3%".  =  _ 1 _ 

y  U  h(r(2-/3) 


k=0 


j-k+ 1 ’ 


(36a) 


where  Ujjq  denotes  an  approximation  to  U(xp,  y^,  t„)  for  all  feasible  (p,  q,  n).  We  also  define  the  following  central  difference 
approximations  to  Ux  and  Uy  respectively: 


=  Wx  iU^j  -  W  =  2F  (UM+1  -  UU-1  >■ 


(36b) 


Using  Crank-Nicolson  time  stepping  method  and  the  finite  differences  defined  in  (36a)  and  (36b),  we  construct  the 
following  discretization  scheme  for  (19a): 

,JAt  IJ  +  ^  (MxU^1  -  -  M^1  +  tU"jhl  +  dF+1j 

+  1  (aiSjq,  -  b,SxU"j  +  aAUE  -  b^UA  +  rt/ft  +  dfo)  =  0  (37a) 

for  l  =  1 , 2 . Mx  —  1 ,  j  =  1,  2 . My  —  1  and  n  =  1,2 . N  satisfying 

%=gt(yj,t„),  Uir0  =  g2(xi,tn),  U°j  =  U*(Xi,yj), 


(37b) 


where  dp.  :=  dfUih)  =  7_[UT  -  is  the  penalty  term. 

Eq.  (37a)  can  be  rewritten  as  the  following  linear  system: 

(i  +  ^M)vn+1  +  ^D(Vn+1)  =  (i  -  ^m)v"  -  1d(V" )  +  F" 


with 


vn  =  K,.^ . 

D(Vn)  =  (dCU^ldCU^) . d(U^_u ),...,  d(U"x_hMy_,))\ 


for  n  =  0,1 . JV-1,  where  I  is  an  (Mx  -  l)(My  -  l)-dimensional  identity,  F"  is  an  (Mx-1)  x  (My-1)  column  vector 

representing  the  average  of  the  contributions  of  the  boundary  conditions  at  time  levels  n  and  n  +  1,  and  M  is  a  block 
matrix  containing  (My  -  1)  x  (My  -  1)  blocks  .  The  size  of  each  block  is  (Mx  -  1)  x  (Mx  -  1). 


A  +  Bi  Bo  0 


0 


B2  A  +  Bi  Bq 


M  = 


B3 


B2  A  +  Bi  Bq 


0 


Bm,,-  2 


B2  A  +  B]  Bo 

B3  B2  A  +  Bj 
B3  B2 


0 

Bo 

A+B, 


(My- l)x(M,-l) 
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Table  1 

System  and  market  parameters  for  the  two- 
asset  American  option. 


a  ,  (3 

1.5 

r 

0.05 

o 

0.25 

K 

30 

Ql.  a2 

0.384 

K  b2 

0.884 

^min »  ymin 

In  0.1 

^max  >  ymax 

In  100 

Table  2 

Convergence  behavior  in  X. 


A  = 

:  10  X  2" 

n  =  1 

n  =  2 

n  =  3 

n  =  4 

k  = 

1 

Error 

1.3229 

0.6857 

0.3493 

0.1763 

log2  Ratio 

0.9480 

0.9731 

0.9863 

k  = 

2 

Error 

1.6211 

0.4752 

0.1246 

0.0315 

log2  Ratio 

1.7704 

1.9307 

1.9825 

k  = 

3 

Error 

1.8691 

0.3343 

0.0447 

0.006 

log2  Ratio 

2.4833 

2.9036 

3.0010 

In  the  above  expression,  A  is  the  discretization  matrix  on  x  direction  in  (35) 
M  +  ’-'  J  =  i+1  <(«rfS  +  *)  I„ 


%  = 


(ixgf  +  ^At,  j  =  i 

Hx&-rix,  j  =  i- 1 

Mxgf-  j  =  i~  l  +  U  =  3... 

0,  otherwise. 


B, 


(Mysf  +  jAt)Iy, 

(/Xyg^  -  T]x)Iy, 
(llygP)Iy, 


j  =  0 
J  =  1 
j  =  2 

j  =  3 . My, 


where  fix  =  -hi 


Ar 


r(2-a)h"  ’ 


,  rjx  =  a^,  Hy  =  -b2 


At 


r(2-/5)hf 


and  r)y  =  a2^-.  Note  that  the  boundary  conditions  g^.  and 


(37b)  have  to  be  defined  by  the  numerical  solutions  of  two  ID  systems. 

The  nonlinear  system  (37)  can  be  solved  by  the  following  damped  Newton’s  iterative  method: 

(i+ jM+1;d(w'-1))w-1  =  (i-  Im)v"-  Idoo  +  F"-  (i+1m)w'-1  -  ^(w'-1), 

c8w‘ -1 


for  1  =  1,2,...  until  a  convergence  criterion  is  satisfied  with  the  initial  guess  w°  =  Vn.  JD(w)  denotes  the  Jacobian  ma¬ 
trix  of  the  column  vectors  D(w)  and  k  e  (0,  1]  denotes  a  damping  parameter.  Then  we  choose  Vn+1  =  lim^^  wl  for  all 
n  =  0, 1, 2, . . . ,  N—  1. 


5.  Numerical  experiments 

In  this  section,  we  present  some  numerical  experimental  results  to  verify  the  theoretical  rate  of  convergence  obtained  in 
Section  3  and  the  rate  of  convergence  of  the  discretization  scheme  in  Section  4  to  demonstrate  the  accuracy  and  usefulness 
of  our  numerical  method.  To  achieve  this,  we  use  two  examples  and  the  first  test  example  is  chosen  to  be  the  following 
American  basket  option  pricing  problem. 

Example  5.1  (American  basket  put  option  pricing).  The  fractional  differential  LCP  (2)  with  system  and  market  parameters 
given  in  Table  1  and  the  weights  Wj  =  w2  =  0.5. 

To  investigate  the  convergence  rates  of  the  method  in  both  X  and  k,  we  choose  a  fixed  uniform  mesh  for  the  solution 
domain  (In (0.05),  ln(100))2  x  (0,  1)  in  (x,  y,  t )  with  Mx  =  My  =  50  and  N  =  50.  Since  the  exact  solution  to  this  problem  is 
unknown,  we  use  the  numerical  solution  with  X  =  106  and  k  =  1  as  the  reference  solution  denoted  as  VR.  We  solve  (19a)  on 
the  aforementioned  uniform  mesh  for  a  sequence  of  values  of  X  and  a  fixed  value  of  k,  and  compute  approximations  of  the 
following  continuous  norm  on  the  mesh  using  the  reference  and  numerical  solutions  VR  and  Vx: 

II^R  “  Vt  lli»o(0,r;L2(n))  +  II  Vr  -  Vx.  llL2(0,r;HX/2(S2j)- 

We  also  calculate  the  base-2  logarithm  of  the  ratio  of  the  errors  from  two  consecutive  values  of  X  for  a  fixed  k  and  the 
results  are  listed  in  Table  2.  From  Theorem  3.4  we  see  that  the  ratio  of  the  errors  in  V x  and  V^2  for  a  given  k  behaves 
like  2 kl2.  However,  from  Table  2  we  see  that  the  computed  ratios  behave  like  2k,  indicating  that  the  rate  of  convergence 
is  of  order  X~k.  In  fact,  it  has  been  proved  in  [15,26,28],  using  the  fact  that  all  the  norms  in  finite  dimensions  are  equiv¬ 
alent,  that  the  power  penalty  method  for  a  nonlinear  complementarity  problems  in  finite  dimensions  satisfying  a  strong 
monotone  condition  has  the  convergence  rate  0(X~k).  However,  the  convergence  rate  in  finite  dimensions  is  not  uniform  in 
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Table  3 

Convergence  behavior  in  k. 


k=  1 

CN 

II 

Si 

k  =  3 

o 

II 

Error 

0.6857 

0.4752 

0.3343 

log2  Ratio 

1.4430 

1.4216 

A.  =  80 

Error 

0.3493 

0.1246 

0.0447 

log2  Ratio 

2.8025 

2.7904 

II 

CD 

O 

Error 

0.1763 

0.0315 

0.0060 

log2  Ratio 

5.5900 

5.6529 

100 


100 


Fig.  1.  Computed  prices  of  an  American  Basket  option  when  a  =  1.5. 


the  dimensionality.  Since  norms  on  an  infinite-dimensional  space  are  usually  not  equivalent,  we  are  unable  to  achieve  the 
0(X~k)- rate  of  convergence  as  in  finite  dimensions. 

We  now  investigate  numerically  the  rate  of  convergence  of  the  method  in  k  for  a  fixed  X  >  1.  From  (28)  we  see  the  ratio 
of  the  errors  in  the  solutions  using  k  and  k  +  1  equals  0(X^k+^/2/Xk/2)  =  C>(7.1/2),  i.e.,  the  ratio  is  a  constant  for  any  k.  The 
computed  results  for  different  values  of  k  and  X  are  listed  in  Table  3,  from  which  we  see  that  the  ratios  of  the  errors  for 
any  two  consecutive  values  of  k  are  almost  constants. 

The  solution  when  a  =  /I  =  1.5  is  illustrated  in  Fig.  1.  We  have  also  repeated  the  above  numerical  experiments  for 
a  =  fi  =  1.3. 1.7  and  found  that  the  computed  convergence  rates  are  the  same  as  the  corresponding  ones  for  a  =  fi  =  1.5, 
which  suggests  that  the  convergence  rates  of  the  penalty  method  do  not  depend  on  the  fractional  order  a  or  /3. 

To  see  the  influence  of  a  and  /3  on  the  option  price,  we  solve  the  test  problem  for  a  =  ^  =  1.3, 1.5, 1.7.  and  plot  the 

cross-sections  at  Sx  =  Sy,  0  <  SX  <  100,  and  t  =  0  in  Fig.  2  of  the  differences  between  the  numerical  solutions,  VFBS,  of  the 

test  problem  and  the  numerical  solutions  of  the  standard  American  option  VBS  (i.e.,  a  =  1 6  =  2).  From  Fig.  2,  we  see  that  the 
American  put  option  from  the  fractional  model  is  more  valuable  than  that  from  the  standard  model.  Also,  the  value  of  the 
option  increases  as  a  and  ji  decreases.  This  phenomenon  is  reasonable  as  when  a  and  /f  are  smaller,  the  price  movement 
is  faster  and  thus  the  option  premium  is  higher,  similar  to  the  case  that  the  larger  the  volatility,  the  higher  the  option 
premium. 

Example  5.2  (Fractional  advection-diffusion  equation).  To  test  the  rate  of  convergence  of  the  discretization  scheme  we 
choose  the  following  linear  fPDE  to  which  the  exact  solution  is  known: 

ut  +  ux-  0D"  u  -  oDyU  =  f(x,y,t) 
with  boundary  and  terminal  conditions 

u(x,  0,  t)  =  u(x,  1,  t)  =  0,  ye  (0,1),  t  e  (0, 1  ], 

u(0,y,  t)  =  u(l,y,  t)  =  0,  xe  (0,1),  f  e  (0,1], 

u(x,y,  1)  =  x3y4,  (x,  y)  e  (0. 1)  x  (0, 1], 

where  /(x,y,  t)  =  x3y4  +  (3x2y4  -  r^Jx3-“y4  +  4x3y3  -  ~  *3y4)t. 
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Table  4 

Computed  rates  of  convergence  for  Example  5.2. 


h: 

~  ^  —  5x2»" 

cGL 

log2€t 

Cm 

l0S2I& 

m 

=  0 

1.9403e— 02 

1.7505e-03 

m 

=  1 

1.4090e— 02 

0.4616 

5.4035e-04 

1.6958 

m 

=  2 

8.2916e-03 

0.7650 

1.4873e— 04 

1.8612 

m 

=  3 

4.4631  e-03 

0.8936 

3.8689e-05 

1.9427 

m 

=  4 

2.3158e-03 

0.9466 

9.8882e-06 

1.9681 

m 

=  5 

1.1801  e-03 

0.9726 

2.5081e-06 

1.9791 

We  choose  a  =  ft  =  1.5  and  the  exact  solution  to  the  above  problem  is  u(x,t)  =x3y4t.  This  problem  is  solved  using 
a  sequence  of  uniform  meshes  with  mesh  sizes  hx  =  hy  =  h  =  At  =  ^  x  2~m  for  m  =  0, 1, . . . ,  5.  For  each  m,  the  following 
discrete  maximum  norm  is  computed: 


Em 


max  max  max 

0<n<N-l,  l<i<Mx-l,  \<j<My-\ 


u(Xi,yj ,  tn)  ~  UJ} 


where  (LFp  denotes  the  numerical  solution  by  the  discretization  scheme.  These  computed  errors,  along  with  computed  rates 

of  convergence  log2(Em+1/Em),  for  k  =  0, 1 . 4,  are  listed  in  Table  4  from  which  we  see  that  the  rates  of  convergence  of 

our  method  are  of  order  0(At2  +  h2  +  h2),  while  a  lengthy  mathematical  proof  of  this  upper  error  bound  can  be  found 
in  [9],  For  comparison,  we  have  also  solved  this  2D  problem  using  the  combination  of  the  Crank-Nicolson  time-stepping 
scheme  and  Griinwald-Letnikov  method  in  [23]  which  is  a  popular  method  for  fPDEs.  The  computed  errors  Ef^’s  and  the 
rates  of  convergence  for  the  GL  method  are  also  listed  in  Table  4,  from  which  it  is  clear  that  GL  method  is  only  lst-order 
accurate,  and  our  method  has  a  2nd-order  convergence  rate. 


Finally,  we  comment  that  the  coefficient  matrix  M  of  the  discretized  system  in  Section  4  is  dense  and  thus  the  computa¬ 
tional  costs  for  solving  the  discretized  system  is  usually  high,  particularly  in  2  spatial  dimensions.  Theoretically,  it  is  known 
that  the  computational  cost  for  solving  the  system  using  the  LU  decomposition  is  of  the  order  0((MX  x  My)3).  The  develop¬ 
ment  of  efficient  algorithms  for  (37)  such  as  conjugate  gradient  based  and  ADI  algorithms  is  a  future  topic  and  challenge  for 
us,  though  it  is  beyond  our  current  discussion.  Also,  a  comparison  of  numerical  performances  of  a  penalty  method  similar 
to  the  current  one  with  the  augmented  Lagrangian  method  for  solving  conventional  American  option  pricing  problems  has 
been  given  in  [32],  Thus  we  refer  readers  to  this  comparison  study. 


6.  Conclusion 


In  this  paper,  we  proposed  and  analyzed  a  power  penalty  method  a  2-dimensional  fractional  differential  linear  comple¬ 
mentarity  problem  for  pricing  American  options  on  two  independent  assets.  We  proved  that  the  solution  from  the  penalty 
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method  converges  to  that  of  the  linear  complementarity  problem  at  the  rate  of  0(k~k/2).  A  2nd-order  accurate  discretization 
scheme  has  also  been  developed  for  solving  the  nonlinear  fractional  partial  differential  equation  arising  from  the  penalty 
approach.  Numerical  experiments  were  performed  to  verify  the  theoretical  rates  of  convergence  and  demonstrate  that  nu¬ 
merical  method  produces  financially  meaningful  results. 
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In  this  paper  we  propose  a  combination  of  a  penalty  method  and  a  finite  volume  scheme  for 
a  four-dimensional  time-dependent  Hamilton-Jacobi-Bellman  (HJB)  equation  arising  from 
pricing  European  options  with  proportional  transaction  costs  and  stochastic  volatility.  The 
HJB  equation  is  first  approximated  by  a  nonlinear  partial  differential  equation  containing 
penalty  terms.  A  finite  volume  method  along  with  an  upwind  technique  is  then  developed 
for  the  spatial  discretization  of  the  nonlinear  penalty  equation.  We  show  that  the  coefficient 
matrix  of  the  discretized  system  is  an  M-matrix.  An  iterative  method  is  proposed  for  solving 
the  nonlinear  algebraic  system  and  a  convergence  theory  is  established  for  the  iterative 
method.  Numerical  experiments  are  performed  using  a  non-trivial  model  pricing  problem 
and  the  numerical  results  demonstrate  the  usefulness  of  the  proposed  method. 

©  2017  Elsevier  Ltd.  All  rights  reserved. 


1.  Introduction 

Valuation  of  options  is  one  of  the  most  important  problems  in  financial  engineering.  For  over  four  decades,  practitioners 
and  academic  researchers  in  finance,  economics  and  mathematics  have  engaged  in  the  study  of  option  pricing.  Various 
option  pricing  approaches  have  been  developed  (see,  for  example,  [1-9]).  One  of  the  methods  is  the  utility  based  option 
pricing  approach  which  has  been  widely  used  for  valuing  European  and  American  options  when  the  trading  of  the  underlying 
stocks  incurs  proportional  transaction  costs  [3,6,10-18],  Recently,  Caflisch  et  al.  [19]  and  Cosso  [20]  applied  this  approach  to 
pricing  European  options  and  American  options  respectively  under  proportional  transaction  costs  and  stochastic  volatility. 
More  specifically,  in  [19]  the  authors  assumed  that  the  underlying  stock  price  follows  a  geometric  Brownian  motion  and 
the  associated  volatility  evolves  according  to  a  stochastic  process  of  the  Ornstein-Uhlenbeck  type.  By  following  the  utility 
maximization  procedure  proposed  in  [3],  they  derived  a  set  of  non-linear  HJB  equations  governing  European  option  prices. 
They  also  obtained  an  asymptotic  expression  for  the  European  option  price  in  the  limit  of  small  transaction  costs  and  fast 
mean  reversion  volatility  under  the  assumption  of  an  exponential  utility  function.  In  [20],  the  authors  considered  American 
option  pricing  with  proportional  transaction  costs  and  stochastic  volatility.  They  assumed  that  the  stochastic  volatility 
follows  the  Cox-lngersoll-Ross  (CIR)  process.  They  also  showed  that  computing  the  price  of  an  American  option  involves 
solving  a  singular  stochastic  optimal  control  problem  and  proved  the  existence  and  uniqueness  of  the  viscosity  solution 
to  the  associated  HJB  equation.  Moreover,  they  solved  the  HJB  equations  using  the  Markov  chain  approximation  when  the 
utility  function  is  exponential. 
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In  this  paper,  we  will  develop  a  new,  efficient  and  accurate  numerical  method  for  computing  European  option  prices 
based  on  the  pricing  model  in  [19,20],  In  both  [19,20],  the  utility  function  is  assumed  to  be  an  exponential  function.  It  is 
well  known  that  using  the  exponential  utility  function  can  reduce  the  number  of  state  variables  in  the  HJB  equation  by  one 
under  a  proper  transformation.  Thus,  the  use  of  this  special  function  can  simplify  the  problem  considerably.  Although  the 
transformation  substantially  reduces  the  computational  cost,  it  may  not  be  applicable  to  other  types  of  utility  functions.  The 
aim  of  this  paper  is  to  develop  a  numerical  method  which  can  efficiently  and  accurately  solve  the  HJB  equation  without  any 
dimension  reduction  technique.  Therefore,  the  numerical  method  developed  in  this  work  can  be  used  for  computing  option 
prices  with  any  types  of  utility  function. 

This  paper  is  organized  as  follows.  In  Section  2,  we  give  a  brief  account  of  the  formulation  of  the  European  option  valuation 
problem  as  a  set  of  HJB  equations  using  the  utility  maximization  theory.  In  Section  3,  we  first  use  a  known  penalty  approach 
to  approximate  the  HJB  equations  by  a  nonlinear  PDE  with  penalty  terms  to  penalize  the  parts  which  violate  the  constraints. 
We  then  propose  a  finite  volume  scheme  for  the  penalty  equation.  In  Section  4,  an  iterative  algorithm  and  its  convergence 
will  be  provided  and  in  Section  5,  we  present  the  numerical  results  to  demonstrate  the  usefulness  of  the  numerical  method. 

2.  The  European  option  pricing  model 

In  this  section,  we  will  present  a  brief  account  of  the  European  option  pricing  model  when  the  volatility  is  stochastic  and 
trading  the  underling  stocks  is  subject  to  proportional  transaction  costs.  A  detailed  mathematical  deduction  of  the  model 
can  be  found  in  [16,20], 

2.1.  Stochastic  volatility  model  with  transaction  costs 

Consider  a  market  consisting  of  a  risky  stock  and  a  risk-less  bond.  Assume  that  the  price  of  the  stock  at  time  u  e  [0,  T], 
denoted  as  Su,  evolves  according  to  the  following  stochastic  volatility  model: 

^  =  p.du  +  VrATOdWj,  (1) 

where  p  is  constant  drift  rate  and  s/v(u)  is  the  volatility  function  which  satisfies  the  following  Cox-Ingersoll-Ross  (CIR) 
process: 

dv(u)  =  %(t]  -  v(u))du  +  ft^/v(u)dW2,  (2) 

where  £  is  the  speed  of  adjustment,  p  is  the  mean  and  ft  is  the  volatility  to  volatility.  In  (2)  r)  and  ft  are  assumed  to  be 
constant  satisfying  2£ rj  >  ft2,  and  Wu'  and  W2  are  Wiener  processes  on  a  filtered  probability  space  (T2,  F ,  (Fu)o<u<t,  P) 
with  correlation  p. 

We  also  assume  that  the  price  of  the  bond,  B(u),  evolves  according  to  the  following  ordinary  differential  equation 
dB(u)  =  rB(u)du, 

where  r  >  0  is  a  constant  interest  rate. 

Suppose  that  the  investors  must  pay  transaction  costs  when  buying  or  selling  the  stock  and  the  transaction  costs  are 
proportional  to  the  amount  transferred  from  the  stock  to  the  bond.  Let  j3u  denote  the  amount  the  investors  hold  in  the  bond 
and  au  the  number  of  shares  of  the  stock  held  by  the  investors  at  time  u  e  [0,  T],  then  the  evolution  equations  for  jBu  and 
au  are 

dpu  =  rfiudu  -  (1  +  0)SudLu  +  (1  -  6)SudMu ,  (3) 

dau  =  dLu  —  dMu ,  (4) 

where  6  e  [0,  1)  represents  the  proportional  transaction  cost  rate  when  buying  and  selling  the  stock,  and  Lu  and  Mu  denote 
respectively  the  cumulative  number  of  shares  purchased  and  sold  up  to  time  u.  Let  c(au,  Su)  denote  the  liquidated  cash  value 
of  the  stock  and  Wu  the  investor's  wealth  at  time  u.  We  have 

c{otu ,  Su )  =  Su  (ctu  G  |  otu  | ) 

Wu(otu,  pu,  Su)  =  pu  +  Su(otu  -  9\au\). 


2.2.  European  option  pricing  via  utility  maximization 

We  now  describe  the  utility  based  option  pricing  approach.  The  idea  of  the  utility  based  option  pricing  approach  is  to 
consider  an  optimal  portfolio  selection  problem  of  an  investor  whose  objective  is  to  find  an  admissible  trading  strategy  to 
maximize  his/her  expect  utility  of  terminal  wealth.  Under  this  approach,  the  reservation  purchase  (respectively  write)  price 
of  an  option  is  the  price  at  which  the  investor  has  the  same  maximum  expected  utility  whether  he/she  buys  (respectively 
writes)  the  option  or  not.  To  use  this  approach  to  value  reservation  purchase  and  write  prices  of  European  call  options,  we 
first  need  to  define  the  following  three  different  utility  maximization  problems. 


DISTRIBUTION  A.  Approved  for  public  release:  distribution  unlimited. 


2456 


W.  Li,  S.  Wang/ Computers  and  Mathematics  with  Applications  73  (2017)  2454-2469 


Problem  2.1  (Utility  Maximization  for  an  Investor  Without  an  Option).  Consider  an  investor  who  trades  only  in  the  underlying 
stock  and  the  bond.  At  time  t  e  [0,  T],  the  investor  holds  p  dollars  in  the  bond  and  a  shares  of  the  stock  of  price  S  with 
volatility  v.  The  objective  of  the  investor  is  to  maximize  the  expected  utility  of  terminal  wealth  over  all  admissible  strategies, 
i.e., 


V°(t,a,  p,S,v)  =  sup  Et[U(WT)]  (0  <  t  <  T),  (5) 

A°(t,a,p,S,v) 

where  V°(t,  a,  p,  S,  v)  denotes  the  investor’s  time  t  maximum  expected  utility  of  terminal  wealth  (also  known  as  value 
function),  Et  denotes  the  expectation  operator  conditional  on  the  time  t  information  (a,  p,  S,  v)  and  U(.)  is  a  utility  function. 
A°(t,  a,  p,  S ,  v)  is  the  set  of  admissible  strategies  available  to  the  investor  which  is  defined  as  the  set  of  right-continuous, 
measurable,  F-adapted,  increasing  processes,  Lu  and  Mu  (t  <  u  <  T),  such  that  the  following  conditions  are  satisfied: 

1.  The  associated  processes  (aLu-Mu,  pL,,Mu,Su,  vu)  satisfy  ( 1 )— (4)  in  [t,  T]  with  the  initial  state  ( t ,  a ,  p,  S,  v). 

2.  p1"’™"  +  SuaLuMu  -  Su9\aLu'Mu\  >  0 ,Vue  [t,  T], 

The  choice  of  the  utility  function  U  is  non-unique  and  a  popular  one  is  the  following  exponential  function: 

U(W )  =  1  -  exp(-yW),  (6) 

where  y  >  0  is  a  constant  risk  aversion  parameter. 

Problem  2.2  (Utility  Maximization  for  an  Investor  Buying  an  Option ).  Assume  that  the  investor  trades  in  the  market  for  the 
underlying  stock  and  the  bond,  and  in  addition,  purchases  a  cash-settled  European  call  option  written  on  the  stock  with 
strike  price  I<  and  expiry  date  T.  Then  the  investor’s  objective  is  to  choose  an  admissible  trading  strategy  to  maximize  the 
expected  utility  of  terminal  wealth,  i.e., 

Vb(t,a,  p,S,v)  =  sup  Et[U(Wj  +  (Sj  —  K)+)j  (0  <t<T),  (7) 

AbU,a,p,S,v) 

where  Ab(t,  a,  p ,  S,  v )  =  A°(t,  a,  p,  S,  v)  and  x+  =  max{x,  0}. 

Problem  2.3  (Utility  Maximization  for  an  Investor  Writing  an  Option ).  Assume  that  the  investor  trades  in  the  market  for  the 
underlying  stock  and  the  bond,  and,  in  addition,  sells  a  cash-settled  European  call  option  written  on  the  stock  with  strike 
price  K  and  expiry  date  T.  Then,  the  investor’s  objective  is  to  maximize  the  expected  utility  of  terminal  wealth  over  the  set 
of  feasible  strategies,  i.e., 

Vw(t,  a,  P,  S,  v)  =  sup  Et[U(WT  -  (ST  -I<)+)]  (0  <  t  <  I),  (8) 

Aw(t,a,f},S,v ) 

where  Aw(t,a,  p,S,v)  denotes  the  writer’s  admissible  strategies  which  are  defined  as  the  set  of  right-continuous, 
measurable,  F-adapted,  increasing  processes,  Lu  and  Mu  (t  <  u  <  T),  such  that  the  following  conditions  are  satisfied. 

1.  The  associated  processes  (aLll’Mu,  pluMu,Su,  vu )  satisfy  ( 1 )— (4)  in  [t,  T]  with  the  initial  state  (t,  a,  p,  S,  v). 

2.  +SU  (aLu-Mu  -  1/(1  -6 ))  -Su9  1/(1  -  0)|  >  0  .Vue  [ t,T ]. 

We  comment  that  Item  2  in  each  of  Problems  2. 1-2.3  represents  the  no-bankruptcy  restriction.  These  conditions  ensure 
that  the  investor’s  wealth  is  positive  at  all  trading  times. 

Using  the  above  problems,  we  now  define  the  reservation  purchase  and  write  prices  of  a  European  call  options  as  follows. 

Definition  2.4  (Reservation  Purchase  Price  of  a  European  Call  Option ).  Consider  an  investor  who  starts  trading  at  time  f  =  0 
with  holding  p  dollars  in  the  bond  and  a  shares  of  the  stock  of  price  S  with  volatility  v.  Assume  that  the  investor  only  can 
buy  the  option  at  the  initial  time  t  =  0.  Then  the  investor’s  reservation  purchase  price  of  a  European  call  option  is  defined 
as  the  amount,  Pj,,  such  that  Vb(0,  a,  p  —  P/,,  S,  v)  =  V°(0,  a,  p ,  S,  v). 


Definition  2.5  (Reservation  Write  Price  of  a  European  Call  Option).  Consider  an  investor  who  starts  trading  at  time  t  =  0  with 
holding  p  dollars  in  the  bond  and  a  shares  of  the  stock  whose  price  is  S  with  initial  volatility  v.  Assume  that  the  investor  can 
only  sell  the  option  at  the  initial  time  t  =  0.  Then  the  investor’s  reservation  write  price  of  a  European  call  option  is  defined 
as  the  amount,  Pw,  such  that  Vw(0,  a,  p  +  Pw,  S,  v)  =  V°(0,  a,  p,  S,  v). 

From  the  above  definitions  it  is  clear  that  computing  reservation  purchase  or  write  price  of  a  European  option  involves 
two  of  the  three  value  functions  defined  in  (5)— (8).  By  the  dynamic  programming  principle,  we  can  derive  an  HJB  equation 
with  a  set  of  appropriate  terminal  conditions  governing  the  value  functions  V°,  Vb  and  Vw. 

Let  Xk,  k  =  1,2,3,  be  the  linear  differential  operators  defined  respectively  by 


,3  3  3  3  1  ,  32  1  ,  32  32 

A  =  -  ^7  +  rP^  +flS^  +^9-  v)—  +  -S  v—  +  -&  V—  +  P&SV—— 

,3 1  3  p  3  S  dv  2  3  Sz  2  dvz  3S3r> 


(9) 
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+(1+0)v 


(10) 


d 


3 


(11) 


Then,  the  value  functions  V°,Vb,  Vw  are  defined  by  the  following  HJB  equation 
miniAV,  £2V,  £3V}  =  0, 


(12) 


for  (t,  a,  p,  S,  v)  €  [0,  T)  x  £2'  x  (0,  +oo)  satisfying,  respectively,  the  following  terminal  conditions: 
V(T,  a,  p,  S,  v)  =  V‘(T,  a,  p,  S,  v ),  (a,  j8,  S,  v)  e  Q‘  x  (0,  +oo) 


(13) 


for  i  =  0,  b  and  w  respectively,  where 

V°(T,  a,  p,  S,  v)  =  U(P  +  S(a  -  0|a|)), 

Vb(T,a,p,S ,  v)  =  U{P+S(a  -  6\a\)  +  (S  -  K)+), 
VW(T,  a,  p,  S,  v)  =  U(P  +  S(a  -  0|ar|)  -  (S  -  K)+), 


(14) 

(15) 

(16) 


and 

£2°  =  £2b  =  {(a,  p,  S)  £  R  x  R  x  R+  :  p  +  Sa  -  S0|a|  >  0}, 

Qw  =  {(a,  p,  S)  €  M  x  R  x  R+  :  p  +  S  (a  -  1/(1  -  0))  —  SO  \a  -  1/(1  -  0) |  >  0}  . 


(17) 

(18) 


In  the  above  (u)+  =  maxfu,  0}.  Note  that  (12)  is  nonlinear  and  it  does  not  have  in  general  classical  solutions.  It  has  been 
proved  in  Cosso  et  al.  [20]  that  the  value  functions  defined  by  (5)-(8)  are  unique  viscosity  solutions  of  their  respective  HJB 
equations  ( 12)— ( 16).  This  is  given  in  the  following  theorem. 

Theorem  2.6.  Let  i  e  {0,  b,  w}  and  assume  that  the  value  function  V1  is  continuous  on  [0,  T]  x  L2l  x  (0,  +oo),  then  the  value 
function  V1  is  the  unique  constrained  viscosity  solution  of  (12)  with  the  terminal  condition 

V(T,  a,  p,  S,  v)  =  V\T,  a,  p,  S ,  v),  for  (a,  p,  S,  v)  e  Ql  x  (0,  +oo), 

where  V'(T,  a,  p,  S,  v)  and  Q'  are  defined  in  ( 14)— ( 16)  and  ( 17)— ( 18)  respectively  for  i  =  0,  b,  ui. 

3.  The  numerical  techniques 

Note  that  ( 1 2 )  can  be  regarded  as  a  constrained  optimization  problem  in  infinite  dimensions.  Thus,  the  numerical  solution 
of  (12)  involves  numerical  optimization  techniques  and  discretization  schemes.  In  this  section,  we  will  propose  a  nonlinear 
PDE  containing  penalty  terms,  called  penalty  equation,  to  approximate  (12).  The  penalty  terms  in  the  penalty  equation 
penalize  the  part  of  its  solution  which  violate  Jiy  >  0  for  i  =  2,  3,  while  £\V  >  0  is  automatically  satisfied  by  the 
formulation.  We  will  then  develop  a  finite  volume  method  along  with  the  full  implicit  2-step  time  stepping  method  for  the 
resulting  penalty  equation  and  show  that  the  system  matrix  is  an  M-matrix. 

3.  t.  The  penalty  approach 

Penalty  methods  have  been  developed  for  solving  both  finite-  and  infinite-dimensional  HJB  equations  [21-25],  In 
particular,  we  propose  a  penalty  method  in  [16-18]  for  the  European  and  American  option  pricing  problems  under 
proportional  transaction  costs  with  a  constant  volatility.  Motivated  by  our  previous  work,  we  propose  the  following  penalty 
formulation  for  ( 12): 


(19) 


£\Vx  +  ^[<£2^7]  +^[=£3^4]  —  0 


for  (t,  a,  p,  S,  v)  e  [0,  T)  x  Q'  x  (0,  +00)  with  the  terminal  condition 

14 (7\  a ,  p,  S,  v)  =  V’(T,  a,  P,  S,  v),  for  (a,  p,  S,  v)  £  &  x  (0,  +00), 


(20) 


where  are  the  differential  operators  defined  in  (9)— ( 11),  V'(T,  a,  p,  S ,  v)  is  the  boundary  condition  given  in  (14)-(16) 
for  each  i,  X  >  1  is  a  penalty  parameter,  i  £  [0,  b ,  w }  and  (u)-  =  min{u,  0}  for  any  function  v. 

For  the  solution  of  (19)  we  have  the  following  convergence  result. 

Theorem  3.1.  For  any  i  £  {0,  b,  w],  let  V1  be  the  unique  constrained  viscosity  solution  of  ( 12)— ( 13).  For  each  X  >  1, 
( 19)— (20)  has  a  unique  viscosity  solution  v[  and  v[  — >  V1  as  X  — >  00. 

The  proof  of  this  theorem  is  essentially  a  repetition  of  the  proofs  ofTheorems  4.2  and  4.3  in  [16]  in  which  £\  is  a  special 
case  of  that  defined  in  (9).  However,  all  the  required  properties  used  in  the  proofs  in  [16]  are  satisfied  by  £\  in  (9).  Thus,  we 
omit  this  proof. 
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3.2.  The  finite  volume  method 


Finite  volume  methods  have  been  used  for  solving  one  and  two-dimensional  Black-Scholes  equations  [26-28],  In  this 
section,  we  will  propose  a  finite  volume  method  with  an  upwind  technique  for  (19).  This  method  has  the  merit  that  the 
coefficient  matrix  of  the  resulting  system  matrix  from  the  method  is  always  an  M-matrix  even  when  p  ^  0  in  (9).  In  fact, 
this  property  cannot  be  achieved  by  any  finite  difference  discretization  scheme.  For  brevity,  we  only  consider  the  case  that 
i  =  b  in  (19)  and  (20).  The  methods  for  the  other  two  cases  are  essentially  the  same  as  that  for  i  =  b  and  thus  are  omitted. 
Before  proceeding,  we  first  rewrite  the  second  and  last  terms  of  (19)  as  the  following  equivalent  form: 

=  min  mX 2Vk,  i*.[oC3V4]-  =  min  hX3Vk. 

me[0,A]  ne[0,A] 

Then,  (19)  can  be  rewritten  as  the  divergence  form  as: 


dV 

9 7 


V  •  (AW)  +  b  ■  W  +  min  mX2Vk  +  min  nX3Vk  =  0, 

me[0,A.]  ne[0,A] 


where 


(0 

0 

0 

0 

/° 

0 

0 

0^ 

0 

0 

0 

0 

0 

0 

0 

0 

1  , 

1 

0 

0 

a33 

a34 

0 

0 

-S2v 

2 

-ptfSv 

\0 

0 

a43 

a44) 

1 

1  , 

v° 

0 

-  pftSv 

-§2v 

2 

W 


(  0  \ 

-rfi 

1 

Sv  +  -pi?S  ~  M-S 
^P§V  +  \®2  ~  V)> 


Let  Q  =  (— oo,  +oo)2  x  (0,  +oo)2.  We  consider  the  problem  in  the  following  finite  region: 


(21) 


(22) 


(23) 


Ql  :=  (— L„,  La)  x  (— Lg,  Lfs)  x  (0,  Ls)  x  (0,  Lv)  C  12 


(24) 


where  L^,  Lp,La,  Lp,  Ls  and  Lv  are  positive  constants.  To  discretize  12L,  we  choose  four  positive  integers  E,  M,  P  and  Z  and 
use  these  integers  to  define  a  uniform  mesh  for  12t  with  mesh  nodes 


cq  =  —  Lff  +  i  x  hi,  i  =  0,  1,  2, . . . ,  E, 
Pj  =  —  Lp  +  j  x  h2,  j  =  0,  1,  2, . . . ,  M, 
Sk  =  l<xh3,  k  =  0,  1,  2,  . . .  ,P, 

V;  =  I  x  h4,  l  =  0,  1,  2, . . . ,  Z, 


where 


/ii 


iff  +  La 
E 


h4  = 


K 
z ' 


Let  h  =  max{hi,  h2,  h3,  h4}.  In  what  follows,  we  will  characterize  this  spatial  mesh  using  the  grid  index  set:  G/,  =  9G/,  U  G/, , 
where  G/,  and  9 G/,  denote  the  index  sets  of  the  interior  and  boundary  mesh  nodes  defined  respectively  by 


Gh  =  {( i,j,k.l )  :  i  =  1,2,  ...,£  -  \,j  =  1,2,  ...,M  -  1,  k  =  1.  2,  ...,P  -  1,  /=  1,2,  ...  ,Z  -  1} 
and 


9G/,  =  {(0,  j,  k,  I),  (E,j,  k.  /),  (i,  0,  k,  /),  (i,  M,  k,  l),  ( ij ,  0,  /),  (i,j,  P,  I), 

(i,j,  k,  0),  ( ij ,  k.  /)  :  i  =  1,  2  . .  .E,j  =  1,  2, . . . ,  M,  k  =  1,  2, . . . ,  P,  l : 


1,2,...,  Z}. 


Clearly,  each  grid  point  (i,  j,  k,  l)  e  G/,  corresponds  to  a  state  (a,,  fij,  Sk,  vp. 

Dual  to  the  above  mesh  (called  primary  mesh),  we  define  a  secondary  mesh  with  the  mesh  nodes 


“'-I  = 


a,_i  +  at 


= 


:-i  + 1 


Pj'-i 


V’  = 

K  2 


J/;-l 


+  sk 


Vl- 1  +  V, 


2 
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for  i  =  0,  ...,£  +  1,  j  =  0, _ M  +  1,  k  =  0, . . . ,  P  +  1  and  /  =  0, . . . ,  Z  +  1  with  the  convention 

“e+1  =  “£• 


a_t  =  a0, 
2 

S  i  =  So, 


.  i  =  f}0,  aM+  i  =  /JM, 


Sp+i  =  SP, 


v_i  =  v0,  v,  i  =  vz. 

2 

For  each  mesh  node  (ij,  k,  l)  e  G/,,  we  define  a  so-called  box  or  control  region  centered  at  the  point  by 


R 


ijkl  : 


j  -  “■•+ 1 )  x  (Pj- I’Pj+l)  x  (S'<- 1  -  V i )  x  (' v i  -  v i )  • 


Integrating  (21)  over  each  control  region  Rpj  and  applying  integration  by  parts  to  the  2nd  term,  we  have 


/ 

J  Rijkl 

+ L 


dv 

—dad/3dSdv  — 


/  < 

J  dRijkl 


m  ™e[(U1 


/ 

J  Rijkl 


/  1 

J  Rijkl 


(AW)  •  nda  +  I  b-  (VV )dad/3dSdv 


min  mX2VkdadfidSdv  +  /  min  nX3VkdadjidSdv  =  0 


nelO.X] 


(25) 


for  (i,j,  k,  l )  e  G/,,  where  3 Rp/  denotes  the  boundary  of  Rp;,  n  the  unit  vector  out-normal  to  3 Rp;  and  dcr  denotes  the  3d 
infinitesimal  along  3Rp(.  Using  the  1-point  quadrature  rule  and  (23),  we  have 


/. 

L 


dv 

—dadjidS  dv 


8Vj 


ijkl 


3 1 


I  Rijkl  I 


,  3V  3V  3V \ 

b-  (W)dadfidSdv  «  i,2  —  +  b3  —  +  ^4—  ) 

3/3  3S  3v  /, 


l%ul. 


(otj,0j,Sk,Vl) 

r  (  dv  dv  \ 

/  min  mX2Vj.dadBdSdv  »  min  ml - |-(1+0)S — )  |Rp;|, 

V  da  dP  J  (ai,f>j,sk,Vl) 

f  _  _/3V  3V  \ 

/  min  nX3VkdadfidSdv  min  n  I - (1  —  0)S —  )  |Rp/|, 

;%W1  V9«  3/6/, 


(26) 

(27) 

(28) 
(29) 


(“i  ,fij,Sk,vi) 


where  |  •  |  denotes  the  ‘measure’  (absolute  value,  area  or  volume  depending  on  the  context)  of  a  quantity  and  Vl]kt  denote  an 
approximation  ofV  at  the  mesh  node. 

We  now  consider  the  approximation  of  the  second  term  in  (25).  Since  Rp  is  a  hyper-rectangle  or  box  in  4D,  3Rp;  contains 
8  3D  rectangular  prisms  or  facets.  Each  of  these  facets  is  perpendicular  to  one  of  the  axes  so  that  its  normal  direction  n  is  in 

or  opposite  the  direction  of  the  axis.  In  fact,  the  possible  normal  directions  are  (±1,0,  0,  0)T,  (0,  ±1,  0,  0)T,  (0,  0,  ±1,  0)T 

and  (0,  0,  0,  ±1)T.  From  the  definition  A  in  (22)  and  these  choices  of  n  we  see  that  AW  •  n  has  only  4  non-zero  terms 
corresponding  to  the  facets  intersecting  S)(±  i  and  v|±i .  Therefore,  we  have 

r  r  (  dv  dv\  r  (  dv  dv\ 

-  AW  •  nda  =  —  I  a33  —  +  a34  —  dad/3dv  +  l  a33— +  a34— )  dadfidv 

Ja Ryu  Ja Rp,  V  ds  dv  J  Jsr fjkl  V  dS  dv  J 

f  (  dV  dV \  f  (  dV  dV\ 

~  /  I  q43~  +  £*44  7“  I  dadjidS  +  /  I  a43  5ttr  +  a44  —  )  da  dfidS, 

Ja RfJkl\  dS  dv  J  J 3R4W  V  ds  dv  J 


where  3R™W,  m  =  1,  2,  3,  4,  denote  the  4  facets  of  3Rp;  on  which  AW  •  n  /  0.  Applying  the  1-point  quadrature  rule  to  the 
above  equation  and  noting  that  these  facets  are  numbered  in  such  a  way  that  |Rjw|  =  | R?w |  and  | Rfjkl |  =  |Rp,|,  we  have 


L 


dv  dv\ 

AW  •  nda  ~  —  I  a33 - h  a34 —  I 


ijkl 


ds 


dv  J 


dV  dV\ 
+  ,Q33  3S  +fl34  3^j 


dv  dv\ 

-  \a43-+a44— j 


dV  dv\ 
±  I  +a44— j 


(“.'W-V  i-n) 


X  l^yk/l 


x  lRp/l 


X  \*U 


X  I^hI. 


(30) 
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where 


4(  =  a,+i) x  (Pi-i’Pj+i) x 

Kki  =  («i_t  x  £,-+,)  x  (sk_i,st+i)  . 


Replacing  the  terms  in  (21)  by  their  respective  approximations  in  (26)-(30),  we  have 


Wm  \  (  dV  ^  dV 

|Rp/|  —  (  Q33^T  +  a34 


3t 


dS 


dv 


(a^j’sk+ }-v‘) 


3V 


3V\ 


IVI+(a33^+a34^J 


3  S 

-  y~i-n’-k+yj 

,  3  V  3V\  ,  4  ,  /  3V  3V 

“  I  a437^  +  044 ^7 ) /  \  lRPil  +  ( a43"^  +  a44^7 

9S  av/(«l,/%A,vH.jj  V  9S  3v 


(“i'A'Vj’’’1) 


I^bI 


(al,/8j,Sk,vl_l'j 


K,\ 


(,  dv  dv  dv\ 

+  \  b2w+b3^  +  t>4^l 
\  dP  dS  dv/(ai,tij. 

_  ( dv  dv 

+  min  n  I - (1  —  6)S  — 

ne[0,A]  \da  d/3 


Sk,vi) 


,  dV  dv 

l%/l  +  min  m  I  +  (1  +  e)s^ 
me[0,l]  \  da  3/3 


l^ykll 


(a/./Sj.Sfe.u/) 


\Rijki\  =  0. 


(31) 


(ctj,Pj,Sk,vi) 


Given  a  positive  integer  N,  we  divide  the  time  interval  [0,  T]  into  N  sub-intervals  with  time  points  tn  =  n  x  At  for 
n  =  0,  1 , ,N,  where  At  =  T /N.  We  let  CAt  =  {0,  1,  2, . . . ,  N)  denote  the  index  set  of  the  time  mesh  points. 

We  now  approximate  the  1st  spatial  derivatives  in  (31)  by  finite-differences.  For  any  admissible  (n,  i,j,  k,  l),  we  denote 
by  V"kl  the  approximation  (to  be  determined)  the  solution  to  (31)  and  (20)  at  the  node  ( tn ,  a,-,  fy,  Sk,  v/).  Using  the  following 
finite  difference  operators 


ijkl 


n+vn  — 

Ua  V  ijkl  — 


n+vn 

US  ''ijkl 


n+vn  — 

uv  vijkl  ~ 


l/n+1  -  Vn 
vijkl  vijkl 

At 

V",  ~  VL 

hi 

Vn  —  vn 

vi(j+m  v  ijkl 

h2 

Vn  —  vn 

Vij(k+i)l  vijld 

h3 

V"  —  vn 

vijk(l+i)  vijkl 


vs,  -  v" 


D„  v; 


h4 


ijkl 


D-sK,k, 


DvV!jkl  = 


vn  —  vn 

v  ijkl  vi(j-\)kl 


V11  —  Vn 
vijkl  vij(k—\)l 


Vn  —  Vn 
v  ijkl  vijk(l- 1) 


D  v”  = 

#(*4)'  h 


—  vn 

r  V  iibl 


Vn  —  V 
vij(k+l)l  v  ijkl 


DsV",  ,,  : 


DvV 


vn  —  vn 

vmi+ 1)  vijkl 


Vn  —  vn 

vijkl  vij(k—1)l 


Vn  —  vn 
vijkl  vijk(l- 1) 


U'k(/+j)  h4  ’  V  i p(l-j )  h3 


we  propose  a  finite  difference  scheme  for  (31)  as  follows: 
-Dt 

v  ~  ■  Z  '1_r  2 

1 


+  2 


^hI^bI  -  lsl+i2  v‘D^k+ ,  JI*o-bI  -  2  (p^Vr')+D>P/l4,l 
^  (p^iv,)+d;v^|^h|  +  1  (p*St_i  v,)“d+v^|^.h| 

^  (p^v;+t)+D+V^|R|(d|  -  1  (p j? St v(+ 1 ) _  Dj  Vp, | Rp, | 

+  2  (^V’)+Ds  ^,1^1 


-  r, 


+ 


2  (p^Sfcv,_i) 


D?Vp(|Rp,|  +  (  °vV% 
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-  r(Pj)+D^kl\Rijk,\  -  r(Pj)-D^kl\Ri]kl\  +  Skv,  +  -pdSk  -  ,iSk  DJV^\Rm\ 


+  (  Skv ,  +  -pdSk  -  ,iSk  I  D^kl\Rijkl\  +  I  -p#v,  +  -r  +  $(n  -  v,)  )  DfV?kl\Rijkl\ 


+  \  2 


+ 


^p#v,  +  V  +  !(»?  -  v,)J  D+V^\Rijk,\  +  rrii(n,  i ,j,  k,  1)(-D+V?kl  +  (1  +  8)SkD~V \Rm\ 

"i(n,  i ,j,  k,  1)(d~V -  (1  -  9)SkD+V^\Rijkl\  =  0,  (32) 


where  (*)+  =  max{-,  0}  and  (•)  =  min{*,  0}  are  as  defined  before  and 


Vn  —  Vn  Vn  —  vn 

i  i\  -  l  v(i+l)jkl  vijkl  mc  vijkl  vi(j—\)kl 

m3(n,  i,j,  k,  I)  =  arg  min  ml - - - - — F  (1  +  8)Sk— - - - 


me[0,A.] 

ni(n,  ij,  k ,  /)  =  arg  min  hi 

ne[0,A]  \ 


•(- 


h  i 


)• 


Vn  —  Vn  Vn  —  vn 

vijkl  v(i—1)jkl  (  Vi(j+T)kl  vijkl 

I h  hi 


)■ 


(33) 


(34) 


Note  that  in  the  original  problem,  only  final/pay-off  conditions  are  fined  defined  in  (14)-(16).  However,  in  computation, 
we  need  to  define  some  artificial  boundary  conditions.  For  a  detailed  discussion  on  artificial  boundary  conditions,  we  refer 
to  [29],  Using  (20),  (17)  and  (18),  we  define  the  boundary  and  terminal  conditions  for  (32)-(34)  as  follows. 


{  U(Pj  +  sk(oti  -  0\ai\)  +  (Sk  -  K)+),  ( i,j ,  k,  l)  e  C„,  xWi  e  S2b, 

|0,  (; i,j,k,l )  e  Gh,xUJ<  $  Qb, 

vn  =  1 1/08/  +  Sk(ai  -  e\ai\)  +  (. Sk  -  I<)+),  (i ,j,  k,  l)  €  9G„,  xw<  €  Qb , 

jo,  (iJ,k,[)edCh,xij,k?S2b, 


(35) 

(36) 


for  n  =  0,  1, . . . ,  N  where  xijfc  =  (a,-,  fy,  Sk)  and  U(W)  is  the  utility  function. 

Let  A  =  (At,  h).  For  the  discretization  scheme  defined  in  (32)  we  have  the  following  theorem. 

Theorem  3.2.  For  any  A  >  (0,  0)  and  given  (n,  i,j ,  k,  l)  e  [0,  X]  and  n3(n,  i,j,  k ,  I)  e  [0,  A.],  the  system  matrix  of  (32)  is 

an  M-matrix  and  the  solution  of  (32)  is  bounded  uniformly  on  Gk  x  GAt- 


Proof.  We  first  rewrite  (32)  in  the  following  equivalent  form: 

i/n+1  —  vn  i  i/n  _  vn 

-  ,]\f  - — M2/i4 

At  2  fc+2  h3 


2 


_L  Vn  —  \/n  1 

^ - ^1^4 

h4  2 


(p#Sfc+iv,) 


\/n  —  vn 

vijkl  vijk(l- 1) 


hih2h4 


K2  Vvm-vm-riihhh  .  - 

+  - ^4 +2 


^  (piisk_,  y,) 


4.  I/'1  _  l/n 

+  vij  Id  vijk(l-l ) 


h\h2h4 


1 

+  2 


(pi7Sk_,v,) 
(pi7Skv|+,) 


l/n  _  Un 

Vijk(l+i)  vijkl 


x  h\h2h4  -  - 


2  (p^vi+  i) 


_1_  Un  _  un 
+  vy(fc+l)/  Vijkl 


un  _  yn  ,  wn  _  i ,n 

vijkl  vij(k—l)l  u  u  u  1,q.2,1  Vijk(l+ 1)  VWU  u  u 

- - hih2il3  ~  -"kVl+ 1 - r - 

tl  3  Z  2  1I4 


_i_  un  _  un  1 

X  h,h2h3  -  - 

h3  2 


+  2 

+  l^1Vm~^m~l)h1h2h3  -  r(Pj)+Vi(i+1>k'~Viiklh,h2h3h4 


Vn  -  vn 

vij(k+\)l  vijkl 


h,h2h3 


h\h2h3 


+  r\(Pj)-\Vm  Vi°'  mh,h2h3h4  +  (skv,  +  \pVSk  -  tiSk)  Viikl  Vjj(*  V>lhih2h3h4 
h2  V  2  /  /i3 


SkVi  +  ^p8Sk  -  p.Sk 


Vn  -  vn 

v  ij(k+\)l  vijkl 


h\h2h3h4 


1 


1 


+  (  -pi?U  +  -i?2  -!(>?-  v,)j  ,}ld  ^  1111(1  11  h3h2h3h4 
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1 


1 


-p&vi  + -V  -?( rj-Vt ) 


Vn  -  Vn 
Vijk(l+ 1)  Vijkl 


lljh2h3h4 


+  m. 


+ 


(n,i,j,k,  /)(- 
ni(n,  i,j,  k,  /)( 


Vdjw  V  +  (1  +  ^  m)h,h2h3h4 


h  i 


l/n  _  1/"  1/n  _  1/n 

VW  v(i-t)jkl  _  (1  _  iO'+l)W  Vijkl 


hi 


h2 


h\h2h3h4  =  0. 


Multiplying  (37)  by 


At 


hth2h3h4 


and  rearranging  the  resulting  equation,  we  have  the  following  system: 


V 


n+l 


pi 


VS 


ijkl 


1,  At  1  1/  \|  At  1,  At  1  I  /  \i 

1  +  2 S«^V,lt  +  2  \(P^Sk+12lv\h^:  +  2 Sk-lVlhi  +  2  K^Sk-iV')l 


+  2 


^|(p^Vi+t)| 


At  1 


h3h. 


+  ~&k  vi+  ’  tt  +  ~ 


SkV ,  +  -p$Sk  -  nSk 


3n4 

At  1 
2 
At 


At 

h3h4 


1/  \|  At  1  ,  At 


hi 


+ 


At 

+  r\(Pi)\-r  + 

h2 

At  At 

+  m1(n,  ij,  k,  l )- - b  m}(n,  ij,  k,  /)( 1  +  0)Sk— 

hi  h2 

At  A  t~ 

+  n,(n,  i,j,  k,  [)—  +  n,(n,  i,j,  k,  1)(  1  -  0)Sk  — 
hi  h2 


\p®v i  +  \ -?('?“  vi) 


4 

At 

h4 


—  vn 

(>+l)j kl 


—  Vn 
vnj+m 


—  vn 

Vi(j—L)kl 


_ wn 

vm+i  u 


At' 

m,(n,  i,j,  k,  /)  — 
hi  . 


—  vn 
V(i-L)jkl 


At' 

n,(n,  i,j,  k,  /)  — 
hi  . 


At 


[r(/Sj)+  +  n,(n,  i,  j,  k,  l)(  1  -  0)Sk]  — 

. 

[r\(Pj)~\  +  mi (n,  i,  j,  k,  0(1  +  0)S*]^ 

. 


1  ,  At  1 

2Si<+iV'^  +  2 


(p&Skv^y^t4+i 


+ 


^S/(V|  +  ^pPS,(  -  /xSk 


At 

h3 


_ un 

Vm-Di 


l 


At  1 


+ 


(^SkV'+i)  ^  + 


At 

h3h4 


+  At 
h3h4 


,  1  v  At 

+  (  Skv,  +  -p$Sk  -  fiSkj  — 


—  vn 

vmi+ 1) 


1  /  \+  At  1  /  \ 

2(P^^V')  /Wu  +  2  (P!?V2y') 


+ 


1 


1 


-pi?v,  +  -0  -  £(»)  -  V|) 


At 


—  v" 

vijk(l-l) 


Kp)  +  I (fos.-.i'i) 


At  1  9  At 

- +  —  ft  j,  Vi,  i  — z- 

h3h4  2  k  l+2  h24 


+  At  1  ,  At 

- +  -#kv,  i  — 

h3h4  2  “J  h4 


,1  1  ,  \+  At 

+  (-P0v,+  -i7  - 


(37) 


(38) 


Noting  that  the  right-hand  side  of  (38)  contains  no  more  than  nine  non-zero  terms,  the  coefficient  matrix  of  the  system  is 
septa-diagonal.  Introduce  an  index  transformation  q  =  q(i,j,  k,  1)  fori  =  1, . . . ,  E—  l,j  =  1, . . . ,  M—  1,  k  =  1, . . . ,  P—1  and 
/  =  1,  ,.Z  —  1  such  that  all  the  interior  nodes  of  the  mesh,  i.e„  those  having  indices  in  G/,,  are  re-ordered  consecutively  in  such 
away thatq(l,  1,1,1)  =  1, q(2,  1,  1,  1)  =  2, . . . ,  q(E- 1,  M- 1,  P- 1, Z- 1)  =  (£-1)  x  (M- 1)  x  (P- 1)  x  (Z-l)  =:  Q. 
Let 


w2(q(i,j ,  k,  /))  =  m,(n,  ij,  k,  1) 


At 


w3(q(i,j,  k,  [))  =  n,(n,  i,j,  k,  l) 


At 
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Wn4 (q(i,j,  k,  /))  =  [r(/3j)+  +  nr(n,  i,j,  k,  J)(l 


0)Sk] 


At 

h2  ’ 


w"(q(i,j,  k,  /))  =  \  r\(fij)  |  +  m1(n,  ij,  k,  0(1  +  9)Sk] 


At 


»S(9(Ufc,0)  =  ^|  +  l 


(.pMkv<+i)+  1  (p^v'-0 


+ 

i 


i 


Skvi  +  ^p9Sk  -  ptSk 


At 

h3 


At  1 


w"(q(i,j,  k,  0)  =  -S,  ,  vi-y  +  - 


(^m) 


At 

h3h4 


+  At 

h3h4 


,  1  \  At 

+  (  Skvi  +  -p$Sk  -  ptSkj 


Wg(q(i,j,  k,  0)  =  ^  (p&Sk+tv^ 


\+  At  1 

/  n  \~ 

/  Mu  +  2 

[p9Sk_iv,J 

At  1  At 

-  +  —  fit-  l^Ji  1  - 

h3h4  2  k  l+2  h\ 


Wn(q(i,j,k,  0)  = 


1 


1 


-ptfv,+  -tf  —  f  (?7  —  V,) 


At 
h4  ’ 


(P^V|)  ^  +  2(^-r') 


+  At  1 


+ 


At 


k  ‘'l-i 


,1  1  ,  \+  At 

+  (  -p^v,+ 


It  is  clear  that 


w"(q(i,j,k,l))>0  1  =  2,3, 

(39) 

9 

w"(q(ij ,  k.  /))  =  1  +  W?(q(i,j,  k,  l ))  >  1. 

(40) 

1=2 


Using  the  above  notation,  we  can  write  (38)  as  the  following  form: 

vw'  =  Vi"kiwKcl(i’j’  k-  0)  -  Vq+DW w2n(<J(U,  k'  0)  -  V]‘i_mw$(qQ,j,  k,  0) 

-  VT0 +m<w,j ,  k.  0)  -  V^jW^qiUh  k,  0)  -  V^k+l)lwn6(q(i,j,  k,  1 )) 

-  Vij(k-Diw"^’j’  k’  '))  ~  v,M+VwZti(i’j’  k,  0)  -  Vijkq-i)W%(q(i,j,  k,  l))  (41) 

for  (i,  j,  k,  l )  e  Gh. 

Casting  the  terms  associated  with  Dirichlet  boundary  points  to  the  LHS  and  swapping  the  LHS  and  RHS  of  the  resulting 
system,  we  see  that  (41)  can  be  rewritten  in  the  following  matrix  form: 


AnVn  =  £>"  +  cn+1 

for  n  =  N  —  1,  N  —  2, . . . ,  0,  where  An  =  (a^)®  t  is  a  septa-diagonal  matrix  with  the  non-zero  entries  given  by 


(42) 


aqq  =  W"(q),  <„+l  =  -W2O?)’  <,+  (£-1)  =  -W4(q),  (43) 

a?,?+(£-i)x(M-i)  =  ~w6(q),  \ij+(£— dxim— Dx(P— 1)  =  —  w8(q),  (44) 

<q-l  =  -<(q).  <„-(£-!)  =  -M  ’s(q)'  (45) 

^q,c/— (£— l)x(M— 2)  =  ~W7(Q)’  aq.ij-(E-l)x(M-l)x(P-l)  =  ~wg(Q^-  (46) 


for  q  =  1,2 , ...  ,Q_,bn  and  cn+1  are  Q.  x  1  column  vectors  representing,  respectively,  the  contribution  from  the  Dirichlet 
boundary  conditions  at  the  nth  time  step  and  the  left-hand  side  of  (41)  involving  the  approximate  solution  at  the  (n  +  l)th 
time  step,  and  Vn  denotes  the  unknown  vector.  This  is  a  Q  x  Q  linear  system  in  Vn.  From  (39)  and  (40),  (43)-(46)  it  is  easy 
to  see  that  An  is  diagonally  dominant,  irreducible  and  has  positive  diagonal  and  non-positive  off-diagonal  elements.  Thus, 
An  is  an  M-matrix  (cf.,  for  example,  [30])  and  is  non-singular.  Therefore,  we  conclude  that  there  exists  a  unique  solution  to 
(42 )/(32)  with  given  rrij (n,  i,j,  k,  l )  e  [0,  A]  and  n,(n,  i,j ,  k,  l )  e  [0,  A.]. 
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We  next  show  that  for  any  A  >  (0,  0),  the  solution  is  uniformly  bounded.  Our  strategy  is  to  prove  that  if  the  terminal 
conditions  satisfy 

max  1 1/^,1  <  C  <  +oo 
t i,j,k,l)ech  m 


for  some  positive  constant  C,  then 

r  max  IV'p,!  <  C.  n  =  l,2,..,N— 1. 

(i,j,k,l)eGh 

By  (35)-(36)  and  (6),  we  have 
max  |Vp,|  <  1. 

DeGh 


(47) 


Let  maX(fjjkj/)€G/I  |  Vjjkl |  <  1  hold  for  an  n  <  N,  we  will  show  that  ma XQjxijeCh  I  V]jkl  1 1  <  1  by  contradiction, 
Suppose  that  max^^g^  |  V^1 1  >  1-  Then,  there  exists  an  index  triple  (i0,  j0,  k0,  lo)  €  G/,,  such  that 

I'Cwol^1  and  \vw'\  <  IV^I,  V(i U,k,l)ech. 

Combining  this  with  (41),  we  have 

9 


IV".  ,  ,  I  > 

1  lQJol<olo  1  — 


ioio^oh 


1  +  I]u’p  ](9o) 
P=  2 


I VGo+ii/ofcolo ’(9o)l  l'/("0-l)j0k0/0lt'3  1((Io)l  l^oOo+lJko'o^  1(<?o)l 


-  - \V& 

-  Kj:mo+^r(<io)\  -  \v^Mh_vw"9-\qo)\ 

=  I^Wol  +  l^-1«?o)l(|Vi-Jo,o|  -  |V(;-+11,okolol)  +  K1(qo)l(|V£-^l  -  IVST-^J) 

+  K-1(qo)l(|V7i"J0,0|  -  | -  l^ioC/o— 1,fe0/0D 

+  I<-1«?0)I(|V,"0-(;(0|  -  Kj0\k0+i)l0\)  +  \wr\c,MKj0l0l0\ - 
+  \wV«i°)\(Kjoh\  -  K~0lo0o+v\)  +  \wV(qo)\(Kj0lak\ - 
*  l^bl 

>  i, 

where  q0  =  q(io,jo,  l<o,  h )■  Clearly,  this  contradicts  our  assumption  that  max^^g^  \V]jkl\  <  1.  Thus,  we  have 

maX(ij  fc  |)gCfl  |  V^y1 1  <  1.  By  the  mathematical  induction  principle,  (47)  holds  and  the  theorem  is  proved.  □ 

To  conclude  this  section,  we  comment  since  the  system  matrix  of  (32)  is  an  M-matrix  by  Theorem  3.2  the  discretization 
is  monotone  which  guarantees  that  the  solution  to  (32)  is  non-negative  since  the  boundary  conditions  (35)-(36)  are  non¬ 
negative. 

4.  Decoupling  algorithm  and  its  convergence 

In  this  section  we  present  a  decoupling  algorithm  for  solving  the  nonlinear  system  (42)/(32)-(34).  We  first  rewrite  (37), 
which  is  equivalent  to  (42)  and  (32),  as  the  following  form: 


£\vm  +  min  m^2Vm  +  min  nX3Viiki  =  0 
1  J  me[0,X]  2  J  ne[(U]  J  J 

for  n  =  N  —  1,  N  —  2, . . . ,  0  and  ( i,j ,  k,  l)  e  G/,,  where  A  =  (At,  h)  as  defined  previously  and 
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(48) 
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We  propose  the  following  algorithm  for  (42). 


(49) 

(50) 

(51) 


Algorithm  D.  1.  Initialize  V^kl  for  all  (i,j,  k,  l)  e  G/,  using  the  terminal  and  boundary  conditions  (35 )— (36)  and  let  n  =  N  —  1. 

2.  Let  =  Vyk ['  for  all  ( ij ,  k,  1)  e  Gk  and  evaluate 


m°(n,i,j,k,l)  =  arg  (  min  mXfV'l+A  , 

\me[0,A]  1  J 

n°(n,  i,j,  k,  l)  =  arg  (  min  h£tvm  ) 

\ns[0,A]  lJla  ) 


for  any  (i,  j,  k,  l)  e  G/,. 

3.  For  a  given  tolerance  e  >  0,  setp  =  0. 

4.  Solve,  V(i,  j,  k,  l)  e  G/,,  the  following  system  along  with  the  boundary  conditions  (36)  for  {Vpf+1  l(ij,u)ec,G 
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where,  when  applied  to  Vpf+\  the  finite  difference  operator  is 
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5.  Evaluate,  for  all  (i,  j,  k,  l )  €  G/,, 
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(52) 


(53) 


(54) 

(55) 


6.  If  maX(jjifc|)6Cj]  |Vn’p+1(i,  j,  k,  l )  —  Vn’p(i,j,  k,  1)1  >  e,  setp  =  p  +  1  and  goto  Step  4.  Otherwise,  goto  Step  7. 

7.  Set  Vp,  =  Vp(p+1  for  (ij,  k,  l)  e  Gh  and  m^(n,  i,j,  k,  l)  =  m4+1  (n,  i,j,  k,  I),  n4(n,  i,j,  k,  I)  =  np+1(n,  i,j,  k,  l)  for 
(i,  j,  k,  1)  e  G/,.  If  n  =  0,  stop.  Otherwise,  let  n  =  n  —  1  and  goto  Step  2. 


Clearly,  in  Algorithm  D,  the  nonlinear  system  (32)— (34)  is  decoupled  so  that  in  each  iteration  we  only  solve  the  linear 
system  (52). 

Using  the  notation  used  in  the  proof  of  Theorem  3.2,  we  let  Vn,p+1  denote  the  solution  to  (52).  Then,  the  convergence  of 
the  iterative  algorithm  in  Algorithm  D  is  given  in  the  following  theorem. 
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Theorem  4.1.  The  iterative  scheme  (52)-(55)  generates  a  sequence  {Vn,p}£l0  that  converges  to  the  solution  of  (48)  with  the 
terminal  and  boundary  conditions  (35)-(36). 

Proof.  We  will  use  the  notation  in  Algorithm  D.  To  prove  this  theorem,  we  first  show  that  the  sequence  {Vnp}^f0  generated 
by  the  iterative  method  is  monotonically  increasing,  i.e.,  Vn'p  <  Vn,p+1  forp  >  1. 

From  (52)  we  have 


for  p  =  1,2, _ This  can  be  rewritten  as 


(56) 


since,  by  (54)  and  (55), 


Note  that  both  (52)  and  (56)  have  the  same  boundary  conditions,  i.e.,  =  V'l^f+I  for  any  (i,j,  k,  l)  e  3 G/,.  Thus,  using  the 

notation  in  the  proof  of  Theorem  3.2,  we  may  write  (52)  and  (56)  as  the  following  respective  matrix  forms  similar  to  (42): 

An,pvn.p+1  =b"  +  cn+l  and  A".Py».P  <(,»  + 

where  Anp,  b"  and  cn+1  are  as  defined  in  (42)  with  Anp  an  M-matrix.  Therefore,  we  have 

A"’P(V n.p+l  _  yn.pj  >  q 

Since  An,p  is  an  M-matrix,  we  have 

yn,p+l  _  yn,p  >  Q 

Therefore,  the  monotonicity  of  iteration  process  is  proved. 

From  Theorem  3.2  we  have  that  Vn'p  is  bounded  for  any  p  =  0,  1.2,....  Combining  the  monotonicity  and  boundedness 
of  Vn,p  we  see  that  Vn,p  is  convergent.  Finally,  from  the  construction  of  (52)  and  (55)  it  is  obvious  that  V^f,  m^n,  i,  j,  k,  l) 
and  rij(n,  ij,  k,  /)  solve  (48)  when  p  ->  oo.  Thus,  we  have  proved  the  theorem.  □ 

5.  Numerical  results 

In  this  section,  we  use  the  scheme  (32)-(36)  to  calculate  the  value  functions  Vl(i  =  0,  b,  w)  and  present  the  computed 
reservation  purchase  and  write  prices  of  a  European  call  option  by  using  the  utility  function  in  (6).  Note  that  using  this  utility 
function  can  eliminate  the  bond  account  variable,  yS,  in  ( 12 )  by  a  transformation.  Flowever,  in  this  paper,  we  will  implement 
our  schemes  without  eliminating  /3  to  demonstrate  our  algorithm  can  also  be  used  for  other  types  of  utility  functions. 

We  now  illustrate  the  performance  and  usefulness  of  the  scheme  using  the  following  test  example: 

Test  Example:  Reservation  purchase  and  write  prices  of  a  European  call  option  with  expiry  date  T  =  0.6,  the  initial  price  of 
the  underlying  stock  S0  =  1.6,  the  initial  value  of  volatility  v0  =  1.6,  the  risk  aversion  parameter}/  =  1  with  various  values 
ofa0,  Po  and  strike  price  K.  Other  parameters  are:  r  =  0.05,  pi  =  0.1 ,9  =  0.005,  f  =  5,  p  =  0.16,  p  =  0.1  and  iT  =  0.9 
To  solve  the  problem,  we  choose  =  Lp  =  \,La  =  3 ,Lp  =  5andLs  =LV  =  3  in  (24).  The  mesh  and  penalty  parameters 
are  chosen  to  be  E  =  20,  M  =  30,  P  =  Z  =  15,  N  =  15  and  X  =  1000.  Other  parameters  are  At  =  0.04,  h  =  0.2.  Using  the 
numerical  solution  on  this  mesh,  we  examine  the  changes  of  reservation  purchase  and  write  prices  with  respect  to  different 
variables. 

We  first  examine  the  influence  of/io  on  reservation  prices.  For  a0  =  2andK  =  1.6,  we  compute  the  reservation  purchase 
and  write  prices  with  various  values  of  /S0  and  the  computed  results  are  plotted  in  Fig.  1.  It  is  clear  from  Fig.  1  that  the  initial 
holding  in  the  bond  does  not  affect  both  the  reservation  purchase  and  write  prices.  This  coincides  with  the  results  in  [19], 
As  mentioned  before,  using  an  exponential  utility  function  can  eliminate  the  bond  account  variable  ft  by  a  transformation. 
Thus,  the  resulting  option  prices  are  independent  of  /30. 

To  examine  the  influence  of  a0  on  the  reservation  prices,  we  compute  the  purchase  and  write  prices  for  a0  =  1 .6,  2,  2.4 
using  our  numerical  method.  Since  the  results  are  independent  of  /30,  we  only  plot  the  computed  purchase  and  write  prices 
Pi,  and  Pw  against  a 0  in  Fig.  2.  From  Fig.  2  we  have  the  following  observations: 

1.  The  write  prices  first  decrease  as  a0  increase  and  then  tend  to  be  stable,  and 

2.  the  reservation  write  price  is  always  higher  than  the  purchase  price. 


DISTRIBUTION  A.  Approved  for  public  release:  distribution  unlimited. 


W.  Li,  S.  Wang /  Computers  and  Mathematics  with  Applications  73(2017)  2454-2469 


2467 


Fig.  1.  Computed  reservation  prices  for  ao  =  2,  K  =  1.6  and  p  =  0.1. 


Fig.  2.  Computed  reservation  prices  for  a0  =  1.6,2  and  2.4. 


These  observations  are  financially  correct  because  of  the  following  reasons.  It  is  known  that  the  utility  based  option  pricing 
approach  values  an  option  from  both  buyers’  and  sellers’  perspectives,  which  leads  to  two  prices,  i.e.,  reservation  write 
and  purchase  prices.  The  reservation  write  and  purchase  prices  are  respectively  the  prices  at  which  the  investor  is  willing 
to  sell  and  the  investor  is  willing  to  purchase  the  option.  Clearly,  the  write  price  is  greater  than  the  purchase  price  as  the 
price  that  the  seller  wants  to  receive  is  always  higher  than  that  the  buyer  wants  to  pay.  Also,  according  to  the  theory  of 
supply  and  demand,  the  more  stock  a  writer  (respectively  purchaser)  holds,  the  more  (respectively  less)  he/she  wants  to 
sell  (respectively  purchase)  the  stock  and  therefore  he/she  will  reduce  the  option  price.  However,  when  the  price  reaches  a 
certain  level,  it  will  not  be  reduced  further. 

Finally,  we  consider  the  influence  of  the  strike  price  I<  on  the  reservation  prices.  To  achieve  this,  we  assume  that  a0  =  2 
and  compute  the  purchase  and  write  prices  for  K  =  1,1 .6  and  2.4.  Again,  since  the  computed  prices  are  independent  of  fi0, 
we  plot  them  against  K  in  Fig.  3  from  which  it  is  easily  seen  that  both  the  purchase  and  write  prices  decrease  as  K  increases. 
The  explanation  for  this  phenomenon  is  as  follows.  If  ST  >  K,  the  option  holder  (buyer)  will  exercise  the  option  at  the  expiry 
date  T.  Thus,  the  buyer  will  earn  Sj  —  I<  and  the  writer  will  lost  the  same  amount.  Since  the  gain/loss  Sr  —  K  is  a  decreasing 
functions  of  K,  when  K  increases,  both  buyer  and  writer  will  reduce  the  option  price. 

We  remark  that  the  original  problem  is  defined  on  an  infinite  domain  and  does  not  have  any  Dirichlet  boundary 
conditions.  In  the  paper,  we  define  an  artificial  (homogeneous)  Dirichlet  boundary  condition  on  each  of  the  boundary 
segments  as  the  exact  one  is  unknown.  The  computational  errors  caused  by  the  above  artificial  boundary  condition  are 
essentially  located  in  the  boundary  layer,  as  shown  in  [29],  Thus,  in  the  numerical  results  presented  above,  we  only  plot  the 
computed  values  at  the  mesh  points  which  are  some  distance  away  from  the  boundary  segments. 
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Fig.  3.  Computed  reservation  prices  for  K  =  1,  1.6  and  2.4. 


6.  Conclusion 

In  this  paper  we  propose  a  penalty  method  combined  with  a  finite  volume  scheme  to  solve  the  HJB  equation  in  4  spatial 
dimensions  governing  the  reservation  purchase  and  write  prices  of  a  European  call  option  with  proportional  transaction 
costs  and  stochastic  volatility.  This  scheme  has  the  merits  that  it  is  easy  to  implement  and  the  resulting  system  matrix  is 
an  M-matrix.  The  latter  guarantees  that  numerical  solutions  from  discretization  method  are  always  non-negative  when  the 
boundary  and  payoff  conditions  are  non-negative.  The  numerical  results  showed  that  the  method  is  able  to  solve  problems 
of  practical  significance. 
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ANALYZING  HUMAN  PERIODIC  WALKING  AT  DIFFERENT 
SPEEDS  USING  PARAMETERIZATION  ENHANCING 
TRANSFORM  IN  DYNAMIC  OPTIMIZATION 


Meiyi  Tan,  Leslie  S.  Jennings  and  Song  Wang* 


Abstract:  In  this  paper,  we  extend  the  human  walking  model  proposed  in  [18]  to  improve  periodic  motion 
and  to  explore  different  walking  speeds.  We  first  propose  the  inclusion  of  additional  constraints  to  better 
model  the  periodic  motion  of  a  human.  We  then  introduce  the  use  of  Control  Parameterization  Enhancing 
Transform  (CPET)  technique  within  the  model,  along  with  the  inclusion  of  velocity  in  the  objective  function, 
to  allow  different  walking  speeds  to  be  used  in  the  model.  Numerical  experiments,  performed  to  show  the 
superiority  of  this  new  model,  show  that  periodic  motion  can  be  improved  with  the  addition  of  periodicity 
constraints  and  that  human  walking  motions  can  be  replicated  at  faster  or  slower  velocities,  which  is  desirable 
in  practice.  The  numerical  results  also  show  that  the  time  interval  in  the  single  support  and  double  support 
phases  can  be  optimized  using  CPET  for  different  walking  speeds. 


Key  words:  optimal  control  of  robot  motion ,  dynamic  optimization ,  biped  locomotion,  robot  modelling  and 
optimization. 

Mathematics  Subject  Classification:  f9M30,  90C90,  93BfO. 


1 


Introduction 


Walking,  though  classified  as  being  the  most  basic  human  motion,  is  deemed  to  involve  great 
complexities  and  thus  should  be  given  significant  attention.  It  is  considered  one  of  the  most 
complicated  motions  involving  a  series  of  complex  continuous  and  discontinuous  phases  [22] . 
Modelling  and  optimization  of  human  walking  are  key  to  robot  design  and  application  in 
industries  such  as  manufacturing  and  health  care.  Human  locomotion  has  attracted  much 
attention  from  researchers  and  practitioners  since  1970s  and  many  of  these  results  have 
been  fundamental  for  advancement  in  the  development  of  support  for  multiple  movement 
disabilities  [4].  The  greater  understanding  of  the  mechanics  behind  walking  can  contribute 
to  the  ability  to  improve  aids  for  people  with  locomotor  disabilities,  and  the  development  of 
walking  robots  etc. 

There  has  been  a  significant  increase  in  research  on  human  walking  in  the  past  decade. 
With  the  advancement  of  computing  power,  simulation  of  human  walking  governed  by  com¬ 
plex  equations  has  now  become  possible.  As  demonstrated  by  existing  studies,  research 
based  on  modelling  and  simulation  contributes  more  significantly  to  the  study  of  human 
locomotion  than  empirical  studies,  since  it  is  able  to  investigate  in  more  detail  the  muscle 

*S.  Wang’s  work  was  partially  supported  by  the  AOARD  Project  #15IOA095  from  the  US  Air  Force. 
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activities  involved  in  the  movement  patterns,  and  can  also  lead  to  the  accurate  prediction  of 
motion  ([14,  15]).  Amongst  many  research  activities  done  on  human  walking,  only  a  handful 
of  them  was  on  modelling  and  simulating  the  dynamics  of  human  walking  ([14,  16]). 

The  purpose  of  this  study  is  to  extend  the  model  in  [18]  to  one  that  improves  periodic 
motion  and  allows  different  walking  speeds  in  optimization.  These  are  achieved  by  re¬ 
design  of  the  constraints,  introduction  of  velocity  in  the  objective  function  and  the  use  of 
the  Control  Parameterization  Enhancing  Transform  (CPET)  technique  [21]  in  the  optimal 
control  model.  In  the  current  model,  walking  speeds  are  adjustable  and  can  be  computed  so 
that  an  evaluation  of  the  dynamics  involved  can  be  explored.  The  study  provides  insights  to 
factors  which  are  important  in  walking  and  can  aid  in  future  development  of  more  realistic 
walking  robots  or  aids  for  the  locomotory  handicapped. 

Various  algorithms  are  available  to  solve  such  problems  and  have  been  reviewed  in  opti¬ 
mization  software  guides  [13].  However,  many  of  these  algorithms  were  not  developed  into 
general  purpose  software  packages  to  solve  optimal  control  problems  that  includes  complex 
constraints.  Software  packages  such  as  MATLAB  or  GAMS  (which  runs  on  MATLAB ’s 
platform)  [3]  may  be  capable  of  solving  such  problems,  but  are  computationally  slow.  For 
these  reasons,  the  MISER3  [8]  optimal  control  software  was  chosen.  The  central  idea  behind 
the  software  MISER3  is  the  concept  of  control  parametrisation  [20],  which  is  used  to  ap¬ 
proximate  the  optimal  control  problem  by  a  constrained  non-linear  programming  problem. 
This  software  is  backed  by  several  theoretical  advancements  over  earlier  versions,  such  as 
superior  technique  for  handling  continuous  state  inequality  constraints  [19]  and  its  ability 
to  solve  Koh’s  study  ([9],  [10],  [11])  on  optimising  performance  for  the  Yurchenko  layout 
vault,  a  complex  optimal  control  problem.  In  addition,  it  includes  software  hooks  to  four 
optimization  algorithms  namely,  FSQP,  NLPQL,  NPSOL  and  NLPQLP.  All  four  algorithms 
use  a  sequential  quadratic  programming  algorithm  which  is  recognised  as  the  most  efficient 
algorithm  for  small  and  medium  size  non- linearly  constrained  optimization  problem  [8]. 
MATLAB  based  systems  for  example,  [5],  which  became  available  after  the  start  of  this 
project  were  considered  too  slow  even  though  the  literature  suggests  good  performance. 

The  rest  of  this  paper  is  organised  as  follows.  In  the  next  section,  we  introduce  the 
geometry  used  for  the  paper.  In  Section  3,  we  propose  new  constraints,  different  from  what 
was  explored  in  Tan  ct  al.  [18],  introduce  the  CPET  technique  and  velocity  decision  variable 
within  the  objective  function.  The  introduction  of  the  mention  will  be  used  to  sought  better 
periodicity  and  different  walking  speeds.  Numerical  experimental  results  will  be  presented 
to  demonstrate  that  this  modified  model,  as  compared  to  the  model  seen  in  Tan  et  al.  18], 
improves  periodic  motion  and  allow  analysis  of  different  walking  speeds,  and  hence  provide 
a  more  realistic  tool  for  human  walking  modelling. 


2 


The  Model 


2.1 


Geometry 


A  link  segment  model  was  used  to  represent  the  human  body  as  depicted  in  Figure  2.1  in 
which  there  are  n  =  7  segments.  For  each  i  =  1, . . .  ,n,  the  ith  segment  has  length  li,  mass 
m,  and  moment  of  inertia  A  about  its  center  of  mass  (CoM) (within  the  segment),  which  is 
a  distance  r;  from  proximal  (xf,  j/f )  and  distance  I*  —  n  from  the  distal  (xf,  yf)  end  (Figure 
2.2).  The  link  segment  model  is  that  used  in  Tan  et  al.’s  [18]. 

Each  segment’s  CoM  position  is  defined  by  (xl,  yi),  and  angle  0i,  where  0;  is  determined 
by  angle  that  segment  makes  with  the  positive  a:-axis.  The  CoM  of  the  whole  body,  ( X ,  Y), 
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is  given  by 


'  X 

1 

sr~\n 

/  j\— \ 

Y 

~  M 

\—\n 

[  2^i=imiVi  \ 

n 

where  M  =  rrii . 

i= 1 


Waft  Dmxin 


Figure  2.1:  Seven-segment  model  Figure  2.2:  An  ith  segment  diagram 


The  positional  equations  for  a  chain  of  segments  (see  Figure  2.1)  are 

x  —  x\e  +  LDce,  y  —  y^e  +  LDse , 

where  x  =  (aq, . . .  ,!„)*,  y  =  (yi, . . .  ,ynY,  e  =  (1, 1. . . . ,  l)4, 
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The  CoM  of  the  whole  system  is  written  in  matrix-vector  form  as  MX  =  mtx  and 
MY  =  mty,  where  mt  =  (mi, . . .  ,mn).  Hence  the  relations,  using  mde  =  M. 

MX  =  mtx  =  Mx  j  +  mtLDce  and  MY  =  mty  =  My\  +  mtLDse. 

The  distal  end  of  Segment  G  of  the  chain  of  segments,  has  co-ordinates 

6  6 

Xg  =  x\  +  ^2  h  cos  Oi  and  vi  =  Vi+^2,  !j‘  sin 

i=l  i= 1 
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2.2 


The  Topology 


The  proximal  incidence  matrix  is  a  j  x  n  matrix  Ap  where 


Apki  = 


if  segment  i  has  proximal  end  at  joint  k, 
otherwise. 


The  distal  incidence  matrix  Ad  is  similarly  defined. 


Ad  - 
Aki  ~ 


1,  if  segment  i  has  distal  end  at  joint  k , 

0,  otherwise. 

The  joint-external  contact  incidence  matrix  B  (j  x  e)  is  defined  as 


Bki  = 


1.  if  joint  k  contacts  the  ground  at  external  contact  i, 
0.  otherwise. 


where  e  is  the  number  of  external  contacts. 

Forces  which  supply  the  rotational  and  translational  motions  to  segment  i  and  are  given 

by 


ft  = 


£CX 

Jk 


fey 

Jk 


/r 

ir 


ft  = 


ft  = 

The  translational  equations  can  be  written  as 

—mil  i  +  Jvu>  +  Sfx  =  —Jxu>2, 

-mill  —  Jxu>  +  Sfv  =  gm  —  Jyu> 2, 
where  proximal,  distal  and  external  forces  are  ordered  such  that 


ft 

ft 


fpx 

r  fpy  i 

t  = 

r 

fdx 

fv  = 

1 

Pi 

_ 1 

We  have  defined:  6  =  {9 1, . . . ,  0n ) L .  ui  =  6,  u>  =  6.  u2  =  (cof, . . . ,  S  =  [/„,  0,  In\. 
The  moment,  equation  can  similarly  be  expressed  in  matrix  form,  with  vector  r  being  a 
vector  of  the  proximal  torques  appropriately  ordered,  as 

Ju  +  Mx  fx  +  Mvf  =  Tt, 

where  J  =  diag(/i,  I2,  ■  ■  ■ ,  In), 

Mx  =  Ds  [~Dr  |  0  |  Dt-Dr], 

My  =  Dc  [  D,  |  0  |  -(Di-Dr)], 

Dr  =  diag(»’i,  r2, .  - .  ,rn),  Dt  =  diag  (hJ2,  ■  ■  -An)- 


The  matrix  T  with  a  torque  acting  between  Segment  6  and  the  external  world  is  given 
by 


segl 
scg2 
T  _  seg3 
seg4 
seg5 
seg6 
seg7 


T~1  T2  T3  T4 

-10  0  0 

1-10  0 

0  10  0 

0  0-10 
0  0  1-1 
0  0  0  1 

0  0  0  0 


T5  t6  t7 

0  0  0] 

0  0  0 

0-10] 

0  0  1] 

0  0  0] 

-10  0] 

0  1  -1 
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The  complete  equations  for  non-heel  contact  are: 

0  =  ui, 
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In  above  equation  we  used  double  square  brackets  to  differentiate  these  rows  from  the  others. 
Depending  on  the  cases  determined  by  the  phase  the  walking  motion  is  in,  some  of  these 
rows  are  not  needed.  For  more  details  on  the  cases  we  refer  to  [18].  The  two  row  vectors  f' 
and  I1,  in  the  above  equation  are  defined  as 


i l*  —  [ii>  h,  h,  h,  h,  Is,  0], 

ill  =  1  llDs, 
ill  =  1  lfDc, 

where  for  proximal  Segment  1 


or  2 1*  —  [0,  h,  h,  I4,  h,  Is,  0], 

or  2  ll  =  2 1*  Ds, 

or  2  ll 
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and  renaming  the  velocities  as  u  and  v,  scripted  appropriately, 

-1 IW  ' 

-i iW  . 

Similarly  for  distal  Segment  6  measured  from  proximal  Segment  2. 


iig 

'  -1  r.v 

4 

— 

4 

+ 

1 

+ 

3 


Numerical  Techniques 


The  simulation  model  (Experiment  Main),  that  was  explored  in  Tan  et  al.’s  [18]  paper, 
was  extended  to  conduct  three  other  experiments.  These  experiments  were  based  on  opti¬ 
mized  toques  obtained  from  Experiment  Main.  The  initial  study  presents  18  states  (a;  = 
[rci,  X2,  ■  ■  ■ ,  2Ji8]T),  15  system  parameters  (z  —  [zi,  z2, . . . ,  ^is]T),  and  7  controls  (r  — 
|t],t2....,t7|  1  )  namely  joint  torques,  were  set  up  in  MISER3.3  [8].  The  18  states  con¬ 
sists  of  the  angular  displacements  from  Segment  1  to  Segment  7  (xt  =  9i,i  =  1,...,7), 
angular  velocity  (sq  =  6 lti  =  8, ,  14),  coordinate  and  velocity  of  proximal  end  of  segment 
one,  ((xi5,xi6,xn,xia)  =  (x^,  x\,  Vi))-  The  system  parameter  consists  of  the  initial 

segment  angular  orientation  (#.;(0)  =  Zi,i  =  1,...,7),  initial  angular  velocity  at  start  of 
single  support  phase  (w;(0)  =  z7+i)  and  z\ 5  is  the  step  length  which  is  twice  the  distance  of 
initial  distance  ofa;f(0)  and  ccg(O)  and  hence  dependent  on  (zi, . . . ,  z6).  Variables  0i,02,#3, 
#4,  $5,  $6  describe  the  angular  displacements  of  the  legs  and  67  describes  the  angular  dis¬ 
placement  of  the  trunk  segment.  . . . ,  w7  are  the  segments’  corresponding  velocities;  and 
(x\,y\)  are  the  coordinates  of  the  proximal  end  of  segment  one  (toe  of  stance  foot)  which 
remains  stationary  on  the  ground  during  one  step  of  the  walk  cycle. 

Normal  walking  has  been  assumed  to  be  symmetric  and  cyclic  and  hence  only  one  step 
of  the  gait  cycle  needs  to  be  modelled  and  simulated.  Periodicity  conditions  are  required 
such  that  the  end  of  the  walk  cycle  is  identical  to  the  start  so  that  successive  steps  repeat 
the  motion  of  the  previous  step  by  swapping  the  roles  of  legs. 

Experiment  1  is  an  extension  from  the  main  experiment,  aimed  at  achieving  a  periodic 
motion.  The  objective  function  remained  the  same,  as  the  main  experiment, 

G0(t,z)  =  I  (CoMypos  —  CoMyinit)2  dt 

J  0 

with  the  following  state  equations: 

fi(t,x,u,z),  t  6  [0,Ti),a;(0)  =  X°{z), 
f2(t,x,u,z),  t  6  [Ti,Tf),x(Ti)  =  hi(x(Ti),z), 

where  Tj  (=  0.386s)  is  the  duration  of  the  single  support  phase,  Tj  (—  0.486s)  is  the  duration 
of  a  step  (single  support  and  double  support  phase)  and  h\(x(Tx  ),  z)  defines  the  new  states 
governing  the  start  of  double  support  phase.  CoMypos  is  the  center  of  mass  of  y-coordinate, 
a  function  of  9(r,  z ,  t),  and  CoMyinit  is  the  initial  center  of  mass  of  y-coordinate,  a  function 
of  z,  as  calculated  by  MISER3.3. 

The  objective  function  is  subject  to  constraints  in  the  canonical  form: 

gk(t,x(t),u(t),z)dt 

where  ngc  is  the  total  number  of  canonical  constraints,  and  t &  £  (0,  tf]  is  a  known  constant 
and  is  referred  to  as  the  ’characteristic  time’  associated  with  the  constraint  Gk[.  All-time 


^  0,  A.  —  1,  •  •  •  ,  TlgC} 


Gk(u,z)  =  (j>k(x(tk)z)) 


rtk 
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constraints  h(t,x,u,  z)  >  0  and  constraints  involving  system  parameters  gk(z)  as  well, 
are  converted  by  MISER3.3  to  canonical  constraints.  The  gradient  of  the  objective  and 
constraint  functions  are  automatically  calculated  by  MISER3.3  using  a  numerical  procedure. 
See  [8]  for  more  details. 

Experiments  1  follow  the  same  constraints  as  Experiment  Main  (See  [18]  for  more  details) 
with  the  exception  of  the  terminal  constraint  on  the  trunk  angular  displacement  at  (Tj  = 
0.486s) 


94  —  0,  M0(Tf),z)=z7-97(Tf)  =  0. 

which  is  now  replaced  with, 

•  a  terminal  constraint  at  the  end  of  the  step  cycle  (Tj  =  0.486s)  on  the  final  9(Tf), 
such  that  the  final  motion  resembles  the  first,  is  given  by, 

3 

94  =  o,  z)  =  -  0r-i(2»  +  3.14)2  + 

i=l 

5 

-  07-i(Tf)  -  3.14)2  +  (zr  -  f)7(Tf))2  =  0. 

i— 4 

Remark:  Once  again,  a  sum  of  squares  of  7  constraints  is  taken  to  reduce  complexity. 


Experiments  2  and  3  aimed  at  investigating  at  maximising  and  minimising  velocity  of 
normal  walking  and  observing  the  motion  and  joint  torques  involved  respectively.  This  was 
done  using  the  Control  Parametrisation  Enhancing  Transform  (CPET)  technique  for  con¬ 
strained  optimal  control  problems  in  MISER3  [21]  that  allowed  the  time  of  a  step  cycle  to 
be  optimized  and  adjusting  the  time  interval  of  single  support  and  double  support  accord¬ 
ingly.  CPET  is  a  technique  which  can  be  used  to  optimise  the  real  time  taken  for  a  set 
of  states  to  move  from  one  configuration  to  another  by  introducing  a  new  control  function 
us(f)  =  ^ .  modelled  as  a  piecewise  constant  control  function  on  user  chosen  knots,  in  this 
case  (£o, si) £3)  =  (0,Tj,Ty).  In  addition,  an  extra  state  function  aqg  is  added  to  the  state 
variables  with  different  equation, 


ds(t) 

dt 


(t), 


£19(0)  —  ZlQ. 


The  optimal  control  problem  is  redefined  with  19  states  (x  =  [a^aqg),  aqg]),  16  system 
parameters  (z  =  [z,zie]),  and  7  controls  (m  =  [u(xig),  us]).  For  experiment  2  and  3,  the 
revised  objective  function  is, 


G0{r,z)  =  I  us{t)(CoMypos  -  CoMyinuf  dt 

Jo 

with  refined  dynamics 

dx 
dt 


us{t)  f  J  *7  U,  z) 

us(t) 

U8{t)f2{xis(t),X,U,Z ) 
us{t) 


t  e  [0,4),x(0)  = 
t  <E  [4, 1))  x(tk)  = 


x°(z) 
z16 

hi{x{tk),z) 

Xl5(tk) 
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Figure  4.3:  Comparison  of  segment  angular  displacements  between  Expl  and  Exp  Main 


Figure  4.5  presents  the  external  forces  acting  on  the  ankle  during  double  support  phase. 
A  negative  external  vertical  force  is  observed  when  swing  heel  comes  into  contact  with  the 
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Figure  4.4:  Exp  1  versus  Exp  Main  -  CoM  displacement  and  velocity 


ground,  which  is  consistent  as  weight  of  the  body  is  being  distributed  from  the  stance  toe 
at  start  and  through  to  double  support  phase.  During  this  period,  a  positive  horizontal 
external  force  keeps  the  heel  in  position,  preventing  it  from  sliding  backwards.  When  the 
present  model  is  compared,  it  can  be  observed  that  there  is  none  or  only  slight  difference  in 
the  horizontal  external  force  on  the  ankle.  However,  in  contrast,  the  present  model  presents 
a  larger  negative  vertical  force  as  compared  to  Experiment  Main.  A  closer  look  at  the 
proximal  and  distal  y-forces  during  double  support  phase  suggest  that  greater  force  in  the 
y-direction  is  present  in  this  model  (Figures  4.6  and  4.7),  and  is  especially  evident  in  the 
forces  acting  on  the  swing  leg. 
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Figure  4.5:  Exp  1  versus  Exp  Main  -  External  forces  (x,  y)  from  ankle  to  ground  during 
double  support  phase 
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Periodicity  constraint  is  a  terminal  time  constraint  which  is  only  active  at  the  end  of  the 
time  interval  (t  =  0.486s)  in  Experiment  1.  It  was  observed  that  the  addition  of  periodicity 
constraint  does  improve  periodic  motion  in  the  model,  with  adjustments  in  segment  angular 
displacements  occurring  mainly  during  the  double  support  phase.  In  addition,  it  improved 
the  objective,  keeping  the  y-CoM  closer  to  initial.  Very  small  changes  or  even  sometimes  none 
were  found  in  the  torques  and  horizontal  forces  between  Experiment  Main  and  Experiment 
1 .  The  main  components  which  influenced  periodicity  were  observed  to  be  the  y-component 
forces  acting  on  the  swing  leg  during  double  support  phase,  although  no  changes  or  only 
slight  changes  were  observed  during  the  single  support  phase.  This  is  not  surprising  since 
this  constraint  is  only  active  at  final  time,  and  adjustments  are  only  required  closer  to 
Tf  =  0.486s.  A  larger  vertical  force  observed  is  possibly  due  to  keeping  the  body  upright, 
and  y-CoM  throughout  closer  to  its  initial  position. 
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Figure  4.8:  Exp  1  versus  Exp  Main  -  Comparison  of  velocity  periodicity  differences 


A  periodic  motion  was  achieved  by  improving  angular  periodicity  with  the  implemen¬ 
tation  of  its  constraints,  however  it  comes  with  its  flaws.  Illustrated  in  Figure  4.8,  larger 
velocity  periodicity  difference  values  was  observed  for  Experiment  1  despite  having  angular 
periodic  constraints  in  place.  Having  periodicity  constraints  to  improve  periodic  motion 
may  not  be  ideal  afterall. 


4.2 


Experiment  2 


Experiment  2  investigated  if  a  maximum  velocity  can  be  achieved  by  extending  Experiment 
Main.  This  was  done  by  minimising  time  and  maximising  distance  in  the  objective  function 
using  CPET  (Section  2).  The  minimum  time  of  a  full  walk  cycle  was  calculated  to  be 
t  =  0.386s,  which  is  0.1s  shorter  than  the  original  time  taken  from  the  data.  Figure  4.9 
depicts  the  horizontal  velocity  of  the  CoM,  where  a  faster  velocity  can  be  observed  from 
Experiment  2  as  compared  to  Experiment  Main.  Velocity  increases  towards  the  end  of 
single  support  phase  and  a  sudden  drop  in  velocity  occurs  as  swing  leg  hits  the  ground 
during  the  very  short  double  support  phase. 

The  result  of  Experiment  2  suggests  that  at  faster  walking  speed,  the  ratio  of  single 
support  phase  in  a  full  walk  cycle  is  nearly  one,  which  indicates  that  double  support  phase 
becomes  almost  negligible  or  can  be  considered  instantaneous.  Figure  4.10  illustrated  a  full 
walk  cycle  of  Experiment  2.  It  was  observed  that  the  whole  swing  foot  comes  into  contact 
with  the  ground  at  the  same  time  instead  of  following  a  heel  strike  then  toe  strike  motion. 

The  optimized  joint  torque  trajectories  for  Experiment  2  are  presented  on  Figure  4.11.  As 
these  trajectories  were  originally  derived  from  the  initial  joint  moment  estimates  computed 
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Figure  4.9:  Exp  2  versus  Exp  Main  -  Horizontal  velocity  of  CoM 


Figure  4.10:  One  step  walk  motion  -  Experiment  2 


by  the  method  of  inverse  dynamics,  they  are  specific  to  the  movement  pattern.  As  compared 
to  Experiment  Main,  in  the  single  support  phase,  the  torques  behaved  similarly.  However, 
as  the  double  support  phase  interval  is  much  smaller,  so  are  the  knots  interval  for  torques 
occurring  during  this  period,  hence  torque  changes  rapidly  so  as  to  follow  through  the 
movement  pattern  at  a  quicker  time.  This  set  of  torque  trajectories  is  able  to  produce  the 
same  movement  characteristics  as  in  Experiment  Main,  depicted  by  the  segment  trajectories 
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in  Figure  4.12  but  in  a  faster  time  frame.  In  addition,  the  horizontal  distance  of  CoM  in 
Experiment  2  travelled  a  slightly  further  distance  than  Experiment  Main  in  a  shorter  time 
(Figure  4.13). 


500 


-500 - ! - 1 - ! - 1 - 

0.38  0.4  0.42  0.44  0.46  0.48 


500 


— 50«r - 1 - 1 - 1 - 1 - ■— 

0.38  0.4  0.42  0.44  0.46  0.48 


500 


509 

0.38  0.4  0.42  0.44  0.46  0.48 


0.5 


0.5 


0.5 


>. 

Q. 


0 

-200 


-400 


Exp  1 
Exp  Main 


0.38 


0.4 


0.42 


0.44 


0.46 


0.48 


0.5 


0.38  0.4  0.42  0.44  0.46  0.48  0.5 


500 


-500 - | - - | - 1 - 1 - 1 

0.38  0.4  0.42  0.44  0.46  0.48  0.5 

time(s) 


Figure  4.11: 


Exp  2  versus  Exp  Main  -  optimized  torques 
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Figure  4.12:  Exp  2  versus  Exp  Main  -  Segment,  angular  displacement  trajectories 
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Figure  4.13:  Exp  2  versus  Exp  Main  -  Horizontal  displacement  of  CoM 


Only  trunk  periodicity,  following  Experiment  Main,  was  considered  in  Experiment  2. 
However,  Experiment  2  achieved  better  angular  periodicity  than  Experiment  Main  (Figure 
4.14).  Table  4.2  presents  the  periodicity  difference  values  of  Experiment  Main,  1  and  2, 
where  Experiment  1  considered  periodicity  constraints.  The  values  indicated  that  periodicity 
constraints  need  not  necessary  be  considered  and  periodic  motion  can  be  achieved  in  a  fast 
walk.  Figure  4.15  displays  the  start  and  end  of  the  walk  cycle,  where  a  periodic  motion  can 
be  observed. 
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Figure  4.14:  Exp  2  versus  Exp  Main  -  Angular  periodicity  differences 
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Experiment  1 
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<o 

-0.0178 
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0.0418 

Z5  ~  02  (tf)  ~  77 

-0.0199 

-0.0043 

-0.0533 

za  -  83 (tf)  -  n 

-0.0224 

0.0048 

0.0256 

Z3  ~  +  77 

-0.0613 

-0.2388 

-0.1196 

^2  -  #5 (tf)  +  77 

0.0155 

0.0920 

0.0914 

Zi  -  9e(tf)  +  7r 

-0.0080 

0.0017 

0.0369 

Z7  —  07  (if) 

0.0234 

0.0006 

-0.0859 

Table  4.2:  Angular  periodicity  differences 


meters 

Figure  4.15:  Start  and  end  of  walk  cycle  with  center  of  mass  position  -  Experiment  2 

Table  4.2  provides  a  comparison  of  the  optimized  system  parameters  of  Experiment  2 
and  Experiment  Main  initial  angles  and  angular  velocity.  Differences  of  the  initial  angular 
displacements  can  be  observed  between  Experiment  2  and  Main.  As  maximising  step  length 
was  part  of  the  objective,  system  parameter  Zj  5 ,  defined  to  be  half  step  length,  had  to  be 
maximised.  Adjusting  Z\y,  to  maximise  step  length,  would  change  the  initial  distance  between 
the  stance  toe  and  swing  toe.  As  compared  to  Experiment  Main,  where  z  15  =  0.7351m,  in 
order  to  satisfy  objective,  the  maximum  boundary  of  215 (0.8m)  was  reached  in  Experiment 
2.  Initial  angular  displacements  had  to  be  adjusted  accordingly  as  Z15  changes  in  Experiment 
2,  in  order  for  certain  constraints  to  be  satisfied  such  as,  stance  toe  and  swing  toe  had  to 
be  of  distance  Z15  while  keeping  the  body  upright. 

The  adjustments  in  initial  angles  could  also  be  the  explanation  which  resulted  in  peri¬ 
odicity  being  achieved  as  illustrated  in  Figure  4.15. 

Vertical  displacement  of  CoM  for  Experiment  2  was  observed  to  be  smaller  than  Experi- 
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Parameters 

Experiment  2 

Experiment  Main 

Zl 

2.7055 

2.7055 

Z2 

1.6499 

1.6269 

Z3 

1.8751 

1.8656 

ZA 

4.4185 

4.5093 

z5 

3.8573 

3.8011 

z6 

4.6592 

4.7978 

z? 

1.6927 

1.6411 

Z% 

-1.1615 

-1.2979 

Z9 

-3.1145 

-2.4389 

ZlQ 

-1.6123 

-1.3727 

Zll 

1.7188 

1.6809 

Zl2 

1.2754 

1.1253 

Zl3 

-4.0441 

-4.0952 

Zl4 

0.0571 

0.3605 

Zl  5 

0.8 

0.7351 

Table  4.3:  System  parameters 


ment  Main  (Figure  4.16).  The  difference  between  the  vertical  displacement  of  CoM  through 
the  full  walk  cycle  and  the  initial  position  of  y- CoM  is  lesser  especially  at  final  time  since  a 
periodic  motion  was  achieved. 
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Figure  4.16:  Exp  2  versus  Exp  Main  -  Vertical  displacement  of  CoM 


The  results  of  Experiment  2  suggested  that  at  faster  walk  speed,  single  support  phase 
contributes  to  the  majority  of  the  walk  cycle  and  double  support  phase  may  be  deemed 
instantaneous.  Forces  and  torques  behaved  similar  to  Experiment  Main  except  during  double 
support  phase.  During  double  support  phase,  forces  and  torques  increase  and  decrease  in  the 
same  direction,  but  due  to  the  short  time  interval  present  in  this  phase,  forces  and  torques 
increase  and  decrease  rapidly.  This  rapid  change  allows  the  model  to  reach  final  position  in 
a  shorter  time  interval.  The  final  result  of  Experiment  2  also  suggested  that  periodic  motion 
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can  be  achieved  at  faster  walking  speed  without  the  need  of  periodic  constraints. 


4.3 


Experiment  3 


Experiment  3  investigated  if  a  minimum  velocity  can  be  achieved  by  extending  Experiment 
Main.  This  was  carried  out  by  minimising  distance  and  maximising  time  but  with  a  lower 
bound  on  distance  and  upper  bound  on  time.  These  limits  were  reached.  The  maximum 
time  for  a  full  walk  cycle  was  restricted  to  be  t  =  0.486s  and  step  length  to  be  1.22?n, 
which  is  0.25m  shorter  than  the  original  step  length.  Figure  4.17  presents  the  horizontal 
velocity  of  CoM  of  Experiment  3.  A  slower  velocity  can  be  observed  in  comparison  with 
Experiment  Main  for  single  support  phase,  but  similar  velocity  is  noted  for  double  support 
phase.  However,  the  jump  in  velocity  was  observed  to  occur  earlier  than  t  =  0.386s,  which 
implies  that  the  duration  for  single  support  phase  is  slightly  shorter  in  Experiment  3  than 
Experiment  Main  and  longer  for  double  support  phase.  CPET  (Section  2)  is  used  to  optimise 
the  change  over  time.  Time  interval  for  single  support  phase  in  Experiment  3  ends  earlier 
at  t  =  0.3762,  which  is  0.0098s  shorter  than  Experiment  Main,  which  lengthened  double 
support  phase. 
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Figure  4.17:  Exp  3  versus  Exp  Main  -  Horizontal  velocity  of  CoM 

The  result  suggest  when  minimising  velocity,  even  though  time  to  complete  a  full  cycle 
remains  the  same  t  =  0.486s,  the  duration  of  each  phases  changed  slightly  with  shorter 
step  length  distance.  Since  step  length  is  smaller,  swing  heel  strike  occurred  in  a  shorter 
time  span,  but  duration  of  double  support  phase  was  lengthened  in  order  to  maximise  time 
to  complete  the  walk  cycle.  Figure  4.18  illustrates  the  horizontal  displacement  of  CoM  of 
Experiment  3  which  was  observed  to  be  shorter  as  t  increases  when  compared  to  Experiment 
Main.  Figure  4.19  depicts  the  walk  motion  of  Experiment  3,  where  the  walk  is  observed  to 
be  tighter  and  the  distance  between  the  swing  foot  and  stance  foot  is  smaller. 

The  optimized  joint  torque  trajectories  for  Experiment  3  were  observed  to  have  similar 
pattern  since  it  was  derived  from  initial  joint  moment  estimates  computed  by  inverse  dy¬ 
namics.  However,  due  to  the  change  in  time  interval,  shifts  in  joint  torques  towards  the  left 
was  observed  in  single  support  phase,  and  towards  the  right  in  double  support  phase  (Figure 
4.20  and  4.21).  During  the  single  support  phase,  as  time  interval  was  smaller,  knot  intervals 
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Figure  4.18:  Exp  3  versus  Exp  Main  -  Horizontal  displacement  of  CoM 


Figure  4.19:  One  step  walk  motion  -  Experiment  3 


of  each  control  were  observed  to  be  smaller  in  Experiment  3,  while  in  the  double  support 
phase,  as  time  interval  was  larger,  knot  intervals  of  each  control  were  then  observed  to  be 
bigger.  This  was  to  accommodate  the  shorter  time  span  in  single  support  phase  and  longer 
time  span  in  double  support  phase. 
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Figure  4.20:  Exp  3  versus  Exp  Main  -  optimized  joint  torque  trajectories  during  single  support 
phase 
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Figure  4.21:  Exp  3  versus  Exp  Main  -  optimized  joint  torque  trajectories  during  double 
support  phase 
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Even  forces  in  Experiment  3,  similar  to  joint  torques,  were  observed  to  have  shifted 
according  to  the  changes  of  time  interval  yet  maintaining  a  identical  pattern  as  Experiment 
Main.  Despite  having  same  motion  pattern,  it  was  noticed  that  external  vertical  forces  on 
the  stance  toe  were  smaller  in  Experiment  3,  but  greater  on  the  swing  ankle  at  joint  6,  when 
comparing  with  Experiment  Main  (Figure  4.22).  With  larger  time  interval  in  double  support 
phase,  more  time  was  allowed  for  the  weight  of  the  body  to  be  distributed  from  stance  foot 
to  swing  foot,  as  swing  foot  is  on  the  ground  for  a  longer  duration.  More  force  is  acting  on 
the  swing  foot  as  it  remains  longer  on  the  ground,  while  stance  foot  prepares  to  lift-off. 
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Figure  4.22:  Exp  3  versus  Exp  Main  -  External  forces  on  joint  1  ( xlp ,  yip)  and  joint  6 
(6x:6y)  during  double  support  phase 
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Table  4.4  provides  a  comparison  of  the  optimized  system  parameters  of  Experiment  3  and 
Main.  As  the  objective  was  to  minimise  step  length,  the  lower  boundary  of  215  (=  0.61m) 
was  reached  in  Experiment  3  to  satisfy  the  objective.  Initial  angular  displacements  were 
optimized  to  satisfy  both  objective  and  constraints  of  the  present  experiment,  and  hence  are 
different  from  previous  models. 


Parameters 

Experiment  3 

Experiment  Main 

M 

2.7055 

2.7055 

Z2 

1.6370 

1.6269 

*3 

1.8695 

1.8656 

Z4, 

4.7526 

4.5093 

Z5 

3.7850 

3.8011 

-6 

4.9598 

4.7978 

M 

1.6630 

1.6411 

Z& 

-1.3694 

-1.2979 

~9 

-2.3578 

-2.4389 

Mo 

-1.1421 

-1.3727 

Ml 

1.3291 

1.6809 

M  2 

0.9549 

1.1253 

M3 

-4.1197 

-4.0952 

Zl  4 

0.2092 

0.3605 

^15 

0.61 

0.7351 

Table  4.4:  System  parameters 

Figure  4.23  depicts  the  first  and  last  segment  of  the  walk  cycle  in  Experiment  3.  A 
periodic  motion  could  not  really  be  noticed  in  the  present  experiment  although  the  only 
periodic  constraint  considered  was  on  the  trunk.  A  further  look  at  angular  periodicity  of 
each  segment  was  presented  in  Figure  4.24  and  Table  4.5.  No  improvements  in  periodicity 
could  be  seen  between  Experiment  3  and  Main,  but  Experiment  1  certainly  faired  better  in 
periodicity  since  periodicity  constraints  were  considered. 


Periodicity 

Experiment  3 

Experiment  Main 

Experiment  1 

1 

1 

vO 

0.0383 

-0.2513 

0.0418 

cn 

1 

to 

C-K 

1 

=3 

0.0093 

-0.0043 

-0.0533 

1 

CO 

C"H* 

!-b  ' 

1 

0.0676 

0.0048 

0.0256 

Z3  -  04.(t f)  +  71 

-0.1712 

-0.2388 

-0.1196 

Z2  ~  Offtf)  +  71 

0.2519 

0.0920 

0.0914 

M  -  06(tf)  +  71 

0.0007 

0.0017 

0.0369 

Z7  —  0-j{tf) 

0.0004 

0.0006 

-0.0859 

Table  4.5:  Angular  periodicity  differences 
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Figure  4.23:  Start  and  end  of  walk  cycle  with  center  of  mass  position  -  Experiment  3 
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Figure  4.24:  Exp  3  versus  Exp  Main  -  Angular  periodicity  differences 


Results  of  Experiment  3  suggested  that  at  a  slower  walk  speed,  duration  of  single  support 
phase  was  shortened  and  double  support  phase  was  lengthened.  Forces  and  torques  behaved 
identically  to  Experiment  Main  but  the  knot  intervals  were  shifted  according  to  the  changes 
in  time  interval  of  each  phase.  Unlike  Experiment  2,  periodic  motion  was  no  different  to 
Experiment  Main,  which  suggested  periodicity  could  only  be  achieve  at  higher  walking  speed 
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or  when  periodicity  constraints  were  considered. 


In  this  paper,  we  have  extended  the  mathematical  method  proposed  in  [18]  to  analyse  human 
walking  behaviour  with  the  addition  of  periodicity  constraints  and  at  various  speeds.  The 
mathematical  method  proposed  is  able  to  simulate  normal  walking  motion  of  a  full  walk 
cycle  at  different  speeds,  with  improvement  in  periodicity  seen  with  either  the  addition  of 
periodicity  constraints  or  at  a  faster  walking  speed.  Periodic  motion  simulation  was  improved 
by  our  method  with  the  addition  of  periodicity  constraints  as  identified  in  Experiment  1  or 
at  faster  walking  speed  as  in  Experiment  2.  Experiments  2  and  3  also  show  that  our  method 
allows  velocity  adjustment  in  modelling  walk  motions  at  different  speeds.  A  main  advantage 
of  the  mathematical  method  developed  is  its  ability  to  model  a  walk  motion  in  different, 
more  realistic  scenarios  as  a  single  process,  instead  of  as  multiple  processes. 

Though  the  method  proposed  here  can  model  human  periodical  walking  accurately  and 
effectively,  the  optimal  solutions  arc  usually  sensitive  to  external  disturbances  as  the  model 
proposed  by  us  is  based  on  an  open-loop  optimal  control  approach.  To  model  more  re¬ 
alistic  human  walking  behaviours,  techniques  for  constructing  optimal  feedback  or  robust 
controllers  such  as  those  in  [6,  7,  17,  1,  2,  12]  have  to  be  used  to  yield  optimal  solutions  that 
are  stable  and  robust  in  the  presence  of  internal  and  external  disturbances.  We  will  discuss 
this  in  future  papers. 
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